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PREFACE 



Quod si tam Celebris est apud omnes gloria Adamantis, atque varia ista 
opuni gaudia, gemmae unionesque, ad ostentationem tantum placent , ut 
digitis colloque cir cumber ant ur; non minori afbciendos speraverim gaudio 
eos, quibus curiositatis conscientia qua m deliciarum est potior, novitate 
corporis alicujus, instar crystalli translucidi, quod ex Islandia nuper ad 
nos perlatum est; cujus tam mira est constitutio, ut haud sciam, num 
alias magis naturae apparuerit gratia. 

Erasmus Bartholinus, Experimenta crystalli islandici disdiaclastici 



Apart from a few objects of our immediate neighborhood (the solar system), all 
the information on the physical phenomena taking place in the Universe comes 
from the radiation that the astronomical objects send into space and that is finally 
collected on earth by telescopes or other instruments. Among the different kinds 
of radiation, electromagnetic waves have by far played the most important role in 
the history of Astronomy probably, it is not unrealistic to say that more than 
99% of our present knowledge of the Universe derives from the analysis of the 
electromagnetic radiation. 

Such radiation contains three different kinds of information, encoded into as 
many physical characteristics typical of any oscillatory propagation phenomenon: 
the propagation direction, the frequency and amplitude of the oscillation, and the 
oscillation direction - or polarization. 

The first one is the most direct and the easiest to measure: the human eye is 
itself a suitable instrument, though of limited accuracy. As time passed, positional 
astronomy became more and more accurate thanks to the invention of the opti- 
cal telescope, the introduction of photographic and digital techniques and, more 
recently, the development of technologies for producing images of a given region 
at different wavelengths via radio, infrared, X-ray and 7 -ray telescopes, often op- 
erating on board of spacecrafts. Our present knowledge of the morphology and 
dynamics of the Universe, and of the different objects of which it is composed 
(from planets to stars, from nebulae to globular clusters, from galaxies to Active 
Galactic Nuclei and to clusters of galaxies) is based on the huge number of such 
observations that have been accumulating during several centuries. 

However, even if very accurate, the measurement of the propagation direction 
of the electromagnetic radiation is inadequate to study other fundamental aspects 
of the physical Universe such as the composition, structure, and evolution of the 
different objects. To this aim, a detailed analysis of the frequency (or wavelength) 
distribution of the energy carried by the electromagnetic radiation is required, 
which was made possible by the invention of the spectrograph. Only through the 
systematic use of spectroscopic methods it has been possible to obtain a direct 
comprehension of the physical mechanisms which govern the equilibrium of stars, 
their birth, evolution and death, and the complicated processes taking place in 
the interstellar medium and in the nuclei of galaxies. Spectroscopy, which is also 
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at the basis of the idea of an expanding Universe, has played such a key role in 
the comprehension of the physical Universe that a new name, astrophysics, was 
introduced in the scientific lexicon to denote the astronomical research based on 
this technique. 

The third, and often neglected, characteristic of the electromagnetic radiation 
is polarization. The earliest studies on polarization were performed around 1670 
by the Danish scientist Erasmus Bartholinus, who was strongly impressed by the 
properties of a newly discovered crystal, the Iceland spar (‘. . . whose behavior is so 
surprising that, as far as I know, never the grace of nature appeared more clearly’) 
and who immediately realized that those properties could prove useful to improve 
human knowledge. 

Since polarization is mainly related to the geometrical aspects of the emission 
process (rather than to its energetics), and since polarization measurements are 
often difficult to perform because of the intrinsic weakness of the signals, the study 
of polarization found its place in the astronomical research with some difficulty. 
But eventually the prediction of Erasmus Bartholinus was fully confirmed: some of 
the most important astronomical discoveries of the 20 th century were made thanks 
to polarimetry - or, more properly, spectropolarimetry. Suffice it to quote the 
discovery of magnetic fields in the sun, stars, and the interstellar medium. 

The first application of spectropolarimetry to the astronomical research dates 
back to 1908 when, using a Nicol prism as a polarizer and a Fresnel rhomb as a 
quarter-wave plate, George Ellery Hale succeeded in taking two spectra of the same 
area of a sunspot in opposite directions of circular polarization. The comparison 
of the spectra showed the presence of the typical signature induced by a strong 
magnetic field, the Zeeman effect. 

Since 1908, things have considerably evolved from the technological point of view. 
Spectropolarimetric observations of the solar spectrum have now attained a sensi- 
tivity level which goes beyond the most optimistic expectations of only two or three 
decades ago. The first dedicated instrument for the measurement of Stokes parame- 
ters profiles in Fraunhofer lines, the ‘mythic’ Stokes-I polarimeter, developed in the 
mid 1970s at the High Altitude Observatory, hardly attained a sensitivity of 1%. 
Nowadays, sensitivity in solar spectropolarimetry has reached the level of 10 -5 for 
spatially unresolved observations, and approximately one order of magnitude less 
for observations at high spatial and temporal resolution. It has to be expected that 
these limits will be rapidly overcome by the next generation polarimeters and that 
the same technologies will be adapted to galactic and extragalactic observations. 

The dramatic increase of polarimetric sensitivity in solar observations has raised 
a serious challenge to the theoretical interpretation. Polarization in spectral lines 
is indeed a subtle phenomenon since, in astroplrysical plasmas, there are several 
physical mechanisms that can generate polarization signatures in line profiles and 
many others that can modify them during the propagation. Some of these mech- 
anisms have been known for a long time from laboratory atomic physics. They 
are just to mention the most remarkable - the Zeeman effect, resonance po- 
larization, and the Hanle effect. Other mechanisms are characteristic of optically 
thick plasmas, and are related to the propagation of radiation in anisotropic media. 
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They are known under the general names of diclrroism and anomalous dispersion, 
though in special cases different denominations are often used (inverse Zeeman 
effect, magneto-optical effects, Faraday rotation, Faraday pulsation, etc.). 

These processes have mostly been studied in specific physical contexts, for differ- 
ent purposes and at different levels of sophistication, and the scientific literature 
on the subject is scattered across books and journals, spanning almost a century 
of active research. For this reason we felt that a book capable of describing, in 
a unique and self-consistent formalism, all the known physical phenomena that 
may affect the polarization signatures of spectral lines, might be welcome to the 
scientific community. The diagnostic content of spectropolarimetry is high, but 
the correct interpretation of the observations requires a full understanding of the 
physics underlying the generation and transfer of polarized radiation. 

The redaction of this book required several years. We might try to say, like Huy- 
gens in the preface to his Treatise on light, ‘The reason is that I wrote it rather 
carelessly in the Language in which it appears, with the intention of translating it 
into Latin, so doing in order to obtain greater attention to the thing , but we feel 
it would be hardly believed. The true reason is that the theory of spectropolarime- 
try is complicated, because it implies some knowledge of several subjects: atomic 
physics, quantum mechanics (with special emphasis on the theory of angular mo- 
mentum and of the density matrix), quantum electrodynamics, radiative transfer 
(both under LTE and Non-LTE conditions). 

Moreover, spectropolarimetry is full of traps: among all the disciplines in as- 
trophysics, there can hardly be found one where more attention has to be payed 
to each single definition, each transformation, each physical application. Sign er- 
rors are especially insidious, as most remarkably shown by the classical example of 
circular polarization in a given wing of a spectral line formed in a magnetic atmo- 
sphere. There are four operations which produce a sign change in such polarization, 
and obviously, there are as many possibilities to make a sign error. To have a sign 
switch, one can: a) invert the direction of the magnetic field; b) interchange the red 
with the blue wing; c) use the opposite definition of positive and negative circular 
polarization; d) consider an emission line instead of an absorption line. This is just 
an example, but it shows very well the subtleties of the subject. We tried to make 
the exposition as clear as possible by using everywhere the same definitions and 
conventions, and by carefully describing all the mathematical developments. 

We hope that this book may be useful to the next generations of scholars that 
will like to enter the field of spectropolarimetry, solar and non-solar. And we hope 
that it may contribute to make this research field more accessible and less hermetic, 
thus attracting more and more scientists to the fascinanting world of the Stokes 
parameters profiles and of their interpretation. 

Arcetri, March 2004 Egidio Landi DeglTnnocenti 

Marco Landolfi 



Christiaan Huygens, Treatise on light (1690), translated by S.P. Thompson, Dover 
Publications, New York, 1962. 
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CHAPTER 1 



DESCRIPTION OF POLARIZED RADIATION 



According to Maxwell’s equations, the electric and magnetic fields associated with 
a radiation beam propagating in vacuum are perpendicular to each other and to 
the direction of propagation. The phenomena of polarization are connected with 
the possibility that is left to the electric field vector (or, alternatively, to the asso- 
ciated magnetic field vector) of spanning the plane perpendicular to the direction 
of propagation. 

The polarization properties of a radiation beam can be described in several dif- 
ferent ways, each of them having its proper advantages and disadvantages. In the 
following we will go through some of these descriptions showing how they are in- 
terrelated and, most of all, how they can be translated into operational definitions 
having a direct physical meaning in terms of measured quantities. 



1.1. The Polarization Ellipse 

We start from the classical (non-quantum) description of the electromagnetic field 
and we refer to the idealized case of a pure monochromatic, plane wave propagating 
in vacuum along the positive 2 -axis of a right-handed coordinate system. A full 
description of the wave can be given either specifying its electric field vector or, 
alternatively, its magnetic field vector, as the two quantities are related by the 
equation 

B{f,t) = nx E(r,t ) , (1.1) 

where n is the unit vector in the direction of propagation, and where E and B are 
both measured in c.g.s. units (the Gauss-Hertz system, that will be used - with 
some few exceptions - throughout the whole book). As far as the cross product 
is concerned, we will follow the almost universally accepted convention that is 
referred to as the right-hand (or screwdriver ) ride, and that is illustrated in Fig. 1.1. 
However, we remind the reader that the sign in Eq. (1.1) is related to the sign 
conventions which have been historically adopted to define the positive directions 
for E and B. These are, in turn, connected with the historical convention of 
adopting the positive sign for the proton charge and, in magnetostatic experiments, 
the positive sign for the idealized monopole which is found at the North-end of a 
dipole oriented in the earth magnetic field (Maxwell, 1873). Note that, according 
to this definition, the earth presently shows a South magnetic pole close to the 
North geographic pole. 

To define the polarization properties of an electromagnetic wave we choose its 
representation in terms of electric field vibrations. Again, there are historical rea- 
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Fig. 1.1. Mutual relationship among instantaneous electric field vector, instantaneous magnetic 
field vector and direction of propagation of an ordinary electromagnetic wave. The cross product 
is defined according to the right-hand convention. 

sons behind this choice; the description in terms of electric vibrations is nowadays 
preferred because polarization measurements are mostly performed, at least in the 
optical spectrum, by means of materials whose interaction with the electromagnetic 
radiation is dominated by the electric vector. However, a description in terms of 
magnetic vibrations would be as appropriate as the former. 

In the right-handed coordinate system introduced above, with its ir-axis pointing 
in an arbitrary direction in the plane perpendicular to the direction of propagation, 
the electromagnetic wave is described by the following expressions 

E x (f, t) = E x cos (kz — tot + </q) 

E y (r, t) = E 2 cos(kz — tot + <j) 2 ) , (1.2) 

where E 1 , E 2 , <j>±, and (f> 2 are four positive constants specifying the amplitudes and 
phases of the electric oscillations, and where k and u> have the usual meaning of 
wavenumber and angular frequency. 

In a given plane that is held fixed in space, for instance in the plane z = 0, the 
electric field vector oscillates according to the equations 

E x (t ) = E 1 cos(wf — </q) 

E y (t ) = E 2 cos (tot - <t> 2 ) . (1.3) 

The tip of the electric field vector rotates in the x-y plane describing an ellipse that 
is called the polarization ellipse and whose characteristic parameters can be found 
with the following algebraic manipulations. 

Let us consider the couple of axes (x'y') that are obtained by rotating the old 
couple ( xy ) through an angle a measured positively from the x-axis to the y-axis 
(see Fig. 1.2). In this new system, with the position 

C = cos a , S = sin a 

we have 

E x ,(t) = (-EjCcosc/q + E 2 S cos (j > 2 ) cos cut 
+ (E-yC sin (j) 1 + E 2 S sin </> 2 ) sin cot 

E , (t) = (~E 1 S cos + E 2 C cos <j) 2 ) cosuit 

+ (—E 1 S sin + E 2 C sin (j> 2 ) sin uit . 



(1.4) 
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Fig. 1.2. Polarization ellipse. 

We now choose the angle a so as to make the x' and y' axes coincide with the 
proper axes of the polarization ellipse, the long axis being directed along x' . If a 
and b are the semi-major and semi-minor axes, respectively, we have 

E x ,{t) = acos(u)t — <p 0 ) 

E y ,(t) = bsm(ut - (j> Q ) , (1.5) 

where the phase (f> 0 must be determined in such a way that a > 0 and a > \b\ . 
Comparing Eqs. (1.4) and (1.5) we obtain 

a cos 4> 0 = E X C cos 4> 1 + E 2 S cos (j> 2 
a sin <p 0 = E X C sin (f> 1 + E 2 S sin (j> 2 
b cos 4> 0 = ~E 1 S sin cf> 1 + E 2 C sin (j) 2 
b sin (j) 0 = E 1 S cos (j) x — E 2 C cos (j) 2 . 

From these equations we easily get 

a 2 + b 2 = El + E 2 2 
ab = —E 1 E 2 sin(0 1 — <p 2 ) 

a 2 — b 2 = ( El — E%) cos 2a + 2 E 1 E 2 cos((/) 1 — (j> 2 ) sin 2a 

(Ef — E%) sin 2ct = 2E 1 E 2 cos {cj) 1 — (j> 2 ) cos 2a . (1.6) 

Equations (1.6) show that the geometrical properties of the polarization ellipse 
depend on four bilinear combinations of the electric field components. Introducing 
the notations 



Pi = El + E 2 
P Q = El - E\ 

Pu = 2^ E 2 cos ((f> 1 — (f > 2 ) 

P v = 2E 1 E 2 sin {(j)-! - cj) 2 ) , 



(1.7) 




4 



CHAPTER 1 



the angle a can be found from the equation 

P Q sin 2a = P v cos 2 a 



added to the further constraint 



a 2 — b 2 = P n cos 2a + P T , sin 2a > 0 . 



If we choose (0,7r) as the interval of definition for a, we obtain, for Pq ^ 0 



1 



P, 



a = - arctan I -P- 



P, 



Q , 



(1.8a) 



where 



and for Pq = 0 



Q; n 



a = 



Pq 


> 0 


and 


Pu 


> o 


if Pq 


> 0 


and 


Pu 


< 0 


if Pq 


< 0 


5 






h if P u 


> 0 






I 71 " if p u 


<0 







(1.8b) 



(1.8c) 



Obviously, in the case Pq = P v = 0 the angle a is left undefined. 

From Eqs. (1.6) we can also find the values of the semiaxes of the polarization 
ellipse. Since 

(a + b) 2 = P I -Py, 

the values of a and b can be found as the solutions of the second degree equation 



1 



x ~ \/ P I ~ I\- x ~ o P v ~~ 0 ’ 



which gives 



a = 



b = 



V P i ~ P v + V P i + p v 



V p i Pv V p i + p v 



(1.9) 



Note that the four quantities defined in Eqs. (1.7) are not independent, being 
related by the expression 



P? = PqP Pu P Py ■ 



( 1 . 10 ) 
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1.2. Special Cases of the Polarization Ellipse 

For some special values of the parameters E 1: E 2 , and (cf> 1 — <j) 2 ), the polarization 
ellipse degenerates either into a segment or into a circle. The first case takes place 
when P v = 0, which means that either E 1 or E 2 vanish, or, alternatively, that 
= (f) 2 or (f> 1 = <f) 2 ± 7 t. From Eqs. (1.9) we get 

a= \J-Pj , 6 = 0 , 

while the angle a can still be deduced from Eqs. (1.8). In this case the monochro- 
matic wave is said to be linearly polarized, and its electric held oscillates in the 
constant plane containing the direction of propagation and the direction, in the 
normal plane, characterized by the angle a. 

The other case corresponds to Pq = P v = 0, which means that E 1 = E 2 and that 
(4> 1 — (f> 2 ) = ± 7t/ 2; in other words the electric oscillations have the same amplitude 
along the x and y axes and are in phase quadrature. For the fourth parameter we 
then obtain 

P V = ±Pi > 

and correspondingly 

a= v /^/ 2, 6 = Tv /^/2. 

In both cases the polarization ellipse degenerates into a circle. When P v = +Pj, 
the tip of the electric held vector rotates clockwise for an observer facing the ra- 
diation source, as apparent from Eqs. (1.5) being a > 0, 6 < 0. Conversely, when 
P v = —Pj the rotation is counterclockwise as seen by the same observer. In this 
book we will adopt the convention of referring to the first case as positive (or right- 
handed) circular polarization, and to the second case as negative (or left-handed) 
circular polarization. In the first case a snapshot of the electromagnetic wave shows 
that the end point of the electric held vector draws a helix that hts the thread of 
a usual, right-handed screw; in the second case we have a left-handed screw. 

Our convention, which is summarized in Fig. 1.3, agrees with those proposed in 
the classical textbooks on polarized light by Shurcliff (1962) and by Clarke and 
Grainger (1971). The same convention is also used, although with some few excep- 
tions, by optical astronomers working in the held of polarimetry. Many radioas- 
tronomers, on the other hand, use the opposite convention, so that the situation 
in this held is still rather confusing (see for instance Clarke, 1974). 

The sign of circular polarization is also connected with the helicity (or spin) of 
photons. This connection is discussed in some detail in the following of this book 
(see Sect. 4.4 and App. 3). 



1.3. Polarization Tensor 

In Sect. 1.1 we have described the electric vibration of a plane monochromatic 
wave by means of the real quantities E 1 , E 2 , (f> 1 , f 2 . Alternatively, the electric 
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Fig. 1.3. Conventions for defining the sign and handedness of circular polarization. The tip of the 
electric field vector rotates, as a function of time, in a fixed plane perpendicular to the direction 
of propagation; the observer, receiving the radiation into his/her eye, sees a clockwise (a) or 
counterclockwise (b) rotation. The corresponding snapshots are shown in the lower half of the 
figure. 



vibration can be described using complex numbers, with the usual convention that 
the physical quantity is represented by the real part of the complex number. The 
first of Eqs. (1.2) can then be rewritten in either of the two equivalent forms (i is 
the imaginary unit) 



E x {r,t) = Re 



_ \(kz—ujt) 

c x e 



(i.ii) 



where £ x = E 1 e 1 ^! , or 



E x {f, t) = Re 



U—kz+cjt) 

C l 6 



( 1 . 12 ) 



where £[ = E 1 e _Il ^i is the complex conjugate of £ 1 . Here and in the following we 
will use the further convention of describing oscillating quantities with a temporal 
exponential of the form e~ lult (rather than e lwt ), so that the expression for E x (r,t) 
in terms of complex notations is that of Eq. (1.11). With the same convention, we 
have for the electric oscillation along the y-axis 



E v {r,t) = Re 



£ 2 e 



\{kz— cut) 



(1.13) 



with £ 2 = E 2 e'^b . 

Using the complex quantities £ ± and £ 2 it is possible to introduce a 2 x 2 Hermitian 
matrix J, called the polarization tensor, defined by 

■),j - £!£, (iJ = 1 , 2 ), 

with 

J* = Jji . (1.14) 

The four parameters describing the geometrical properties of the polarization ellipse 
(Eqs. (1-7)) are related to the components of the polarization tensor by the simple 
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expressions 



P i = £* 1 £ 1 + £* 2 £ 2 

p _ p* f __ p* p 
r Q ~ °1 °1 °2 °2 

p u = £1 £ 2 + £; £- t 

P v = \{£{£ 2 -£* 2 £ 1 ) . (1.15) 

Note that a minus sign would appear in the expression for P v if Eq. (1.12) were 
used. 

After inversion of Eqs. (1.15) the polarization tensor can be written as 

j _ 1 / p i + p q P u ~ i Pv \ 

~2[pu + iP v P r -P Q ) ’ 

which can be cast into a more synthetic form. Defining the formal vector 



f p 0\ 


f Pl \ 


Pi u 


p q 


P 2 


p u 


\PJ 


\p v ' 



and introducing the 2x2 identity matrix <r 0 and the Pauli spin matrices oq, <r 2 , 
and <r 3 , we can write 1 

1 3 

J =2^ Pi<T - (L16) 

i=0 

where 



a 0 = 







^3 




(1.17) 



with 

(T o cr k= 6 jk (T o+' l ^^jki cr i {j,k,l = 1,2,3). (1.18) 

i 



In the last equation Sj k is the Kronecker symbol - equal to 1 for j = k and to 0 
for j ^ k - and e^ kl is the antisymmetric (or Levi-Civita) tensor - equal to +1 if 
( j,k,l ) is an even permutation of (1,2,3), to —1 if is an odd permutation 

of (1, 2, 3), and to 0 if two (or three) indices are equal. 

Multiplying Eq. (1.16) by a - and using the relation 



r Tx(a j < T i ) = 28ji (i,j = 0,...,3), (1.19) 



The representation that we have chosen here for the Pauli spin matrices differs from the 
standard one, initially introduced by Pauli (1927) and then followed in classical textbooks on 
Quantum Mechanics (Schiff, 1949; Messiah, 1961; etc.). The matrices introduced by Pauli (cr x , 
(Ty, ct z ) are connected with the ones defined here by the relations: cr x = cr 2 , cr y = cr 3 , cr z = cr l . 
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we can rewrite Eqs. (1.15) in the form 

Pj = Tr (<Tj J ) ■ (1.20) 

Finally, Eq. (1.10) can be expressed in the compact form 

clet J = 0 . (1.21) 

1.4. Quasi-monochromatic Wave 

The pure monochromatic wave considered in Sect. 1.1 is nothing but a mathemati- 
cal abstraction; in the physical world one always deals with radiation beams having 
a non-zero angular spread and a finite frequency bandwidth. Such beams can be 
described using wave packets of the form 

E(r, t) = Re J £{k') „(#) d 3 k' , 

where £ (k 1 ) is the complex electric field amplitude of the single wave propagating 
with wavenumber k' and angular frequency u/ = c\k'\, with 

£(k') • k' = 0 , 

and where n(k') is the number density of waves in the three-dimensional wavenum- 
ber space. 

We now suppose the wave packet to be confined into a small range of wavenum- 
bers Ak centered around k. This means that our packet has an angular spread 
in the solid angle A£l ss |Z\fc|/|fc| and a finite frequency bandwidth Auj ss c|Z\fc|. 
Writing 

k! = k + 6k , t J = ui + 5ui 

we obtain 

E(r,t) = Re j £{k + 5k) n(6k)d 3 (5k) . (1.22) 

Let us consider a surface element E perpendicular to the k direction, and a co- 
ordinate system ( xyz ) as shown in Fig. 1.4. At any point P of the surface E the 
components of the electric field vector are given by 

E x (P,t) = Re [f 1 (P,t)e“ ia;t 

Ey(P : t) = Re[£ 2 (P,t)e~ iMt ] , 



(1.23) 
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Fig. 1.4. Coordinate system for defining the various components of the electric vibration of a 
quasi-monochromatic wave. The vector k is directed along the z-axis. 



where £ x (P,t) and £ 2 (P,f) are the x and y components, respectively, of the vector 
resulting from the evaluation of the integral in Eq. (1.22). The third component 
E Z (P, t) is much smaller than the x and y components (a typical ratio E z /E x being 
of order Ail) and will be neglected in the following. 

If the linear dimensions L of the surface E are such that L\Ak\ <C 1, the quan- 
tities £ x (P, t) and £ 2 (P,i) are weakly varying functions of the point P. Moreover, 
they are slowly varying functions of time, having temporal variations on typical 
time-scales much larger than the period of the wave. As a result, the tip of the 
electric field vector at the point P describes a polarization ellipse whose character- 
istic parameters vary slowly with time; these parameters will also show a smooth 
variation from point to point on the surface E. 

Obviously, it is impossible to define an instantaneous polarization ellipse in this 
situation. It is however possible to define appropriate average quantities by gener- 
alizing Eqs. (1.15) in the form 



Pi = (£* 1 (P,t)£ 1 (P,t)) + (£ 2 *(P,i)£ 2 (P,t)> 
P Q = <£*(P,t)^(P,t))- <£*(P,t)£ 2 (P,f)> 
P v = (£* 1 (P,t)£ 2 (P,t)) + (^(P.i^P.t)) 



P V = i 



(£*(p,t)5 2 (p,t))-(£: 2 *(p,t)f 1 (p,t)> 



(1.24) 



where the symbol (• • •) means an average over a time interval much longer than 
the wave period and an average over the surface E. 

Similarly, one can generalize the definition of the polarization tensor by writing 



J ij = (£*(P,t)£ j (P, f)> , (1.25) 

so that Eqs. (1.16) and (1.20) are still valid. 

The important difference from the case of the monochromatic wave lies in the 
fact that Eqs. (1.10) and (1.21) no longer are valid. To prove this statement, let 
us consider the complex quantity A defined by 

A = £ 2 (P ,t) (^(P.i^P.t)) -^(P.t) (S;(P,t)S 2 (P,t)) . 
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For the average value of its square modulus we have, with easy transformations 

< |.4| 2 > = (£{ (P, t) £,(P, t)> [<£? (P, t) ^(P, t)){£* 2 (P, t) £ 2 (P, t)) 

~(£* 1 (P,t)£ 2 (P,t))(£* 2 (P,t)£ 1 (P,t)) . 

From this equation, excluding the trivial case ^(P, t) = 0, we conclude that, being 
(£f{P,t)£ 1 (P,t))>0 and <|_4.| 2 > > 0 , 

we must have 

<£ 1 *(P,t)£ 1 (P,t))<£ 2 *(P,t)£ 2 (P,t))- <£r(P,t)£ 2 (P,t)><£ 2 *(P,t)£ 1 (P,t)) > 0 , 
that is 

det J = Pf - P%- - Py> 0 • (1.26) 

The equal sign holds only in the case ^4 = 0, which implies 

g 2 (p ,*) (gr(p,t)g 2 (p ,*)) 

^(P ,t) (£;(?,£)£,(?,£)) ’ 

this means either that one of the two components is identically zero, or that the 
two components have, at any point P and any time t, the same amplitude ratio and 
phase difference. In both cases the quasi-monochromatic wave is said to be totally 
polarized. The opposite situation occurs when the two components £ 1 (P, t) and 
£ 2 (P,t) have the same average amplitude and random phase difference, so that 

(£*(P,t)£ 1 (P,t)) = (£* 2 (P,t)£ 2 (P,t)) 

(£*(P,t)£ 2 (P,t)) = {£* 2 {P ,t)£ 1 (P,t)) = 0 . 

We have in this case 

Pq = Pu = Pv = 0 i 

and the quasi-monochromatic wave is said to be totally unpolarized (if the frequency 
of the radiation beam is in the visible band of the spectrum, the beam is said to be 
composed of natural light). In intermediate cases the quasi-monochromatic wave 
is said to be partially polarized with a polarization degree given by 




with 0 < p < 1. 
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Fig. 1.5. For a partially polarized quasi- monochromatic wave, the tip of the electric field vector 
at a given point P describes, as a function of time, the trajectory shown in this graph. Note that 
the characteristic shape of the polarization ellipse has a slow variation with time. The electric 
field is given in arbitrary units. 



In Figs. 1.5 and 1.6 the trajectory of the tip of the electric field vector is plotted 
for two schematic situations corresponding to a partially polarized and a totally 
polarized quasi-monoclrromatic wave, respectively. 



1.5. Polarizers and Retarders 

In the previous sections we have described the polarization properties of a radiation 
beam using appropriate averages of the electric vibrations. Now we must give some 
operational definitions capable of relating these properties to actual measurements 
to be performed on the beam. For this purpose it is necessary to introduce the 
concept of ideal polarizing filters , a concept that will be used in the following to 
give an operational definition of the Stokes parameters. 

We define an ideal linear polarizer (sometimes called analyzer ) as a device that 
is totally transparent to the electric vibration along a given axis (called the trans- 
mission - or acceptance - axis of the polarizer) and totally opaque to the electric 
vibration along the axis perpendicular to the former. When such a device is in- 
troduced in the optical path of a radiation beam, the components of the electric 
vibration are modified inside the polarizer according to the equation 
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Fig. 1.6. Same as Fig. 1.5 for a totally polarized quasi-monochromatic wave. Although the size of 
the ellipse changes with time, its characteristic shape (i.e. the a angle defined in Fig. 1.2 and the 
ratio between the axes) remains fixed. 



where £ a and £ b are the components, at the entrance of the polarizer, along the 
transmission axis and the orthogonal axis, respectively; £' a and £ b are the same 
components at the exit of the polarizer, and if is a phase difference that is totally 
inessential in determining the polarization properties of the beam. 

The simplest device whose performances approach in practice those of an ideal 
polarizer (in the optical range of the electromagnetic spectrum) is the well-known 
Polaroid which has been developed commercially in various forms by the Polaroid 
Corporation, Cambridge, Massachusetts, U.S.A. This device is based on the prop- 
erty shown by some organic, needle-shaped microcrystals of absorbing light prefer- 
entially along the longest axis of the crystal. Roughly speaking, a Polaroid consists 
of a large number of such microcrystals oriented through a procedure of unidirec- 
tional stretching. Since the basic physical phenomenon involved in the operation 
of a Polaroid is dichroism (the property of absorbing light to different extents de- 
pending on the polarization of the incident beam), the Polaroid is often referred to 
as a dichroic polarizer. 

Linear polarizers of a different kind are the so-called birefringence polarizers 
which consist of one or more birefringent crystals (generally calcite) suitably pre- 
pared and arranged into a compact system. These polarizers operate according to 
the following scheme: the incident beam is divided in two orthogonally polarized 
components that propagate along different directions in the birefringent crystal; 
one of the two components is physically removed by absorption or lateral deflec- 
tion, so that a single, linearly polarized component is transmitted through the 
whole system. Examples of birefringence polarizers are those bearing the names of 
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Ahrens, Wollaston, Rochon, Glan-Foucault, and Nicol. More details on birefrin- 
gent polarizers, as well as on dichroic polarizers, can be found in Shurcliff (1962). 
In the same book different kinds of linear polarizers, like reflection polarizers and 
scattering polarizers, are also described. 

Turning now our attention to retarders, we define an ideal retarder as a device 
which divides the incident beam in two orthogonally polarized components, de- 
plrases one of them relative to the other without altering their amplitudes, and, 
finally, reunites the two components giving rise to a beam whose polarization char- 
acteristics are, in general, completely different from those of the incident beam. 
An ideal retarder (sometimes called compensator ) is characterized by the fact that 
the phase velocity of an electromagnetic wave propagating along the optical axis 
depends on the direction of the electric vibration. For an electric vibration parallel 
to the fast axis the retarder is characterized by the index of refraction n { , while 
for an electric vibration parallel to the slow axis the index of refraction is n s , with 
n s > n { . 

Let us consider a monochromatic wave that propagates along the optical axis of 
a retarder of given thickness l. The electric field carried by the wave is described 
in complex notations by the equations 



E f (z, t ) = Re 



n \{n f kz—ujt ) 

t f e 



E s (z,t ) = Re 




i(n s kz—cjt) 



(0 <z<l) 



where z is the coordinate along the optical axis of the retarder with the origin 
( z = 0) at the entrance, £ { and £ s are the complex amplitudes of the electric 
vibration along the fast and slow axes, respectively, and k is the wavenumber in 
vacuum. 

Comparing the electric vibrations at the exit of the retarder with those at the 
entrance, we have with obvious notations 




where f> is a phase that does not affect the polarization properties of the wave, 
while 6, the so-called retardance (or retardation), is given by 

S = (n s — n f ) kl = 27 r (n s — n f ) l / A , (1.28) 

A being the wavelength in vacuum. When the thickness of the retarder is such that 
8 = 7t/2, the retarder is called a quarter-wave plate, and, similarly, when 8 = 7r the 
retarder is called a half-wave plate. 

It is interesting to note that an ideal quarter-wave plate followed by an ideal linear 
polarizer whose transmission axis is directed at 45° from the fast axis in the coun- 
terclockwise direction (as seen by an observer facing the radiation source) behaves 
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fast axis 




plate polarizer 

Fig. 1.7. Practical realization of a filter transparent to positive circular polarization and opaque 
to negative circular polarization. 



like an ideal filter that is totally transparent to positive (or right-handed) circular 
polarization and totally opaque to negative (or left-handed) circular polarization. 1 

To prove this statement, let us consider a circularly polarized monochromatic 
wave propagating along the direction shown in Fig. 1.7. If the wave is totally 
circularly polarized in the positive direction, the components of the electric field 
vibration along the unit vectors e x and e y before entering the retarder are (in 
complex notations) 



E x = Re 




E y = Re 




(1.29) 



where A is a constant. At the exit of the retarder, we have from Eq. (1.27) (apart 
from an inessential phase factor) 



E 'x= E y = Re 



Ae 



(1.30) 



thus the component of the electric vibration along the transmission axis of the 
polarizer is given by 



E„ 



1 

71 



(E' x +E' y ) = V 2 Re 



A 



— ic ot 

e 



Since this component is simply transmitted through the linear polarizer, the elec- 
tric vibration at the exit is still described by Eq. (1.30). Therefore, although the 



Obviously, when the transmission axis is inclined at 45° in the opposite direction, the optical 
combination behaves in the opposite way with respect to circular polarization. 
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polarization has been transformed from circular to linear, the energy carried by 
the wave has not been changed by the system (the overbar means a time average) 

W x +W y = E^+E^=\A \ 2 . 

It follows that the device schematized in Fig. 1.7 is totally transparent to positive 
circular polarization. 

Applying the same argument to a monochromatic wave of negative circular po- 
larization (E in Eq. (1.29) must be changed in sign) we obtain that the wave is 
totally absorbed inside the polarizer. 

Most of the retarders that are used in practice are of the birefringent type; they 
often consist of a crystal, like quartz or calcite, cut parallel to the optical axis. More 
recently piezo-optical birefringence modulators have also been used. In these devices 
the birefringence is induced by compressing or distending a material (otherwise 
isotropic and uniform) along a given direction. According to Kemp (1969) a very 
modest force, that can even be attained by simple finger pressure, is enough to 
produce a quarter-wave plate by stressing a block of glass having the size of a 
match-box. 

Other devices that have been often used to produce retarders with variable or 
modulated retardance are the so-called Pockels cells. Here the birefringence is 
induced by application of high electric fields (of the order of several kV /cm) in ma- 
terials such as ammonium dihydrogen phosphate (ADP) or potassium dihyclrogen 
phosphate (KDP). 



1.6. Stokes Parameters 

A simple operational definition of the Stokes parameters can be given in terms of a 
set of ideal filters and an ideal detector capable of measuring, in absolute units, the 
electromagnetic energy falling on its acceptance area. For this definition - or, more 
precisely, for the definition of the two parameters connected with linear polariza- 
tion - it is also necessary to fix a particular direction in the plane perpendicular to 
the direction of propagation. Such direction will be referred to in the following as 
the reference direction or the reference axis. 

Given a quasi-monoclrromatic radiation beam having a small aperture dfi, and 
frequency contained in the interval (v, v + d v), we define the first Stokes parameter, 
I, as the energy measured by our ideal detector per unit time and per unit cross- 
sectional area. In formulae, if dW 0 is the energy measured by the detector in the 
time interval df over the surface dA oriented perpendicularly to the beam, we have 

dW 0 = I dS dt dO dv . 

This is just the usual definition of the radiation specific intensity, so that the first 
Stokes parameter is simply called the intensity of the beam. 

Let’s now insert in the beam path an ideal linear polarizer with its transmission 
axis directed in succession at 0°, 45°, 90°, and 135° from the reference axis, all the 
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angles being reckoned counterclockwise for an observer looking at the beam from 
the detector. If dW x , d W 2 , dll 3 , and dW 4 are the corresponding energies measured 
by the detector in the time interval dt over the surface clS", the second and third 
Stokes parameters, Q and U, are defined by 

dW 1 — dW 3 = QdS dt clO dv 

dll 2 — d W 4 = U dS dt dfi dp . 

Finally, for the definition of the fourth parameter we introduce two ideal filters, 
the first being totally opaque to negative (left-handed) circular polarization and 
totally transparent to positive (right-handed) circular polarization, and the second 
behaving in the opposite way (such filters can be realized as shown in the former 
section). If dW 5 and dW 6 are the energies measured by the detector (in the time 
interval dt over the surface dS") with the first and second filters - respectively - 
interposed, the definition of the fourth parameter, V , is the following 

dW 5 — dW 6 = V dS dt clfi dp . 

The symbols used in this book to represent the Stokes parameters (/, Q, U, V) 
were introduced for the first time by Walker (1954). In his original note, Stokes 
(1852) used the symbols ( A , B , C, D ), while Jones (1941) and Perrin (1942) used the 
notation (/, M, C, S). Walker’s notation seems nowadays to be preferred especially 
in the astroplrysical literature. 

The operational definitions given above are not universally adopted. In partic- 
ular, the opposite sign is used in the definition of V by those authors who name 
positive circular polarization the one that we have defined as negative and vice 
versa (see the discussion at the end of Sect. 1.2). Less frequently a sign inversion is 
found in the definition of U. Figure 1.8 summarizes the conventions that are used 
in this book. 

Now that the Stokes parameters have been defined, it is necessary to establish 
their connection with the description, given in the previous sections, of the polar- 
ization properties of an electromagnetic wave in terms of electric fields. Since the 
energy flux carried by an electromagnetic wave is given by its Poynting vector, the 
energy AW measured in the time interval At by an ideal detector having cross- 
sectional area AS is connected with the electric field components by the relation 

AW = ^{El + E 2 y ) AS At , 

where E x and E are the components of the electric field along two axes perpen- 
dicular to each other and to the direction of propagation, and where the brackets 
mean an average over the time interval At and over the surface AS of the detector. 
Using Eqs. (1.3), (1.11), and (1.13), we can also write 

AW = (El + El) AS At = (£{ £ x + £* £ 2 ) AS At . (1.31) 
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Fig. 1.8. Schematic representation of the definition of the Stokes parameters. The observer is 
supposed to face the radiation source. 

Referring to the general case of a quasi-monochromatic wave, we introduce a couple 
of orthogonal unit vectors e a and e b , with e a specifying the reference direction and 
e b oriented in such a way that e a , e b , and the direction of propagation form a right- 
handed coordinate system. 1 If £, (P, t) and £ 2 {P ,t) are the complex amplitudes 
of the electric field along the e a and e b directions, the energy measured by the 
detector in the various configurations specified above is given by (see Eqs. (1.23)): 
a) without any filter 





AW 0 = 



(£*(P, t) £,( P, t)> + (£ 2 *(P, t) f 2 (P, t)> 



— AS At 

8tt 



b) with a linear polarizer interposed, having its transmission axis at 0°, 45°, 90°, 
and 135°, respectively 



AW, = (£*(P ^^{P^—ASAt 

= \ (£*AP,t)£ 1 (P,t)) + (£Z(P,t)£ 2 (P,t)) 

+ (£Z(P,t)£ 2 (P,t)) + (^(P.tJf^P.t)) 
AW 3 = <£*(P, t) £ 2 (P, t))^AS At 



8tt 



■AS At 



In the following, e a will be referred to as the reference direction unit vector , while e b will 
be referred to as the associated unit vector. Note that the definition of the Stokes parameters 
remains the same if we replace e a by — e a and e b by — e b . 
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AW 4 = ^ (^(P,t)£ 1 (P,t)) + (^(P,t)£ 2 (P,t)) 

- (£*(P,t)£ a (P,t)) - (^(P.i^P.t)) ^ASAt; 

c) with a filter for circular polarization interposed, opaque to negative polarization 
(and transparent to positive), and vice versa 

AW S = \ (£UP,t)£i(P,t)) + (£*2(P,t)£ 2 (P,t)) 

+ i (£* (P, t) £ 2 (P, f)) - i (£ 2 (P, t) ^(P, t)) ^ASAt 

AW e = ^ (fr(P,t)f 1 (P,t)) + (f 2 *(P,t)f 2 (P,*)> 

- i <^(P, t) S 2 (P, t)) + i (£* (P, t) SAP, t)) At. 

From these expressions, taking the limit for infinitesimal values of AS and At, and 
bearing in mind Eqs. (1-24) and the operational definitions of the Stokes parame- 
ters, we obtain 

I = kPj = k[(£AP,t)SAP,t)) + <£ 2 *(P,f)£ 2 (P,t)> 

Q = kP Q = ^[(^(P.i^P.i)) - (£ 2 *(P,t)£ 2 (P,f)) 

U = kP v = k [(SUP, t) S 2 (P , t)) + (£*(P, t) SAP,t)) 

V = kP v = ki[(S* l {P,t)S 2 {P 1 t))^(S* 2 {P,t)S l {P,t))\ , (1.32) 

where k is a dimensional positive constant whose actual value is important only 
when absolute measurements are to be performed. 

It should be kept in mind that Eqs. (1.32) are valid only when the component S 1 
of the electric vibration refers to the reference axis defining the Stokes parameters, 
and when the conventions implicit in Eqs. (1.11) and (1.13) are used. Note also 
that Eqs. (1.32) give a deeper physical meaning to the quantities Pj, Pq, P v , P v 
introduced in former sections (see Eqs. (1.7), (1.15), and (1.24)). 



1.7. Measurements of the Stokes Parameters 

The Stokes parameters can be measured by several different techniques, and the 
various instruments that have been devised for this purpose cannot be classified in 
a unique scheme. Following Hauge (1976) it is however possible to describe a kind 
of prototype instrument that summarizes the essential characteristics of most of 
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Fig. 1.9. Schematic representation of a prototype instrument aimed at the measurement of the 
Stokes parameters. 



the devices that are used in practice, especially for the measurement of the Stokes 
parameters in the visible or near-infrared range of the electromagnetic spectrum. 

Referring to Fig. 1.9, if £ 1 and £ 2 are (i n complex notations) the electric vibrations 
along the unit vectors e a and e b (where e a is the reference direction) , we have, after 
the beam has crossed the retarder (inessential phase factors are omitted) 



£ { = £ x cos a + £ 2 sin a 

£ s = e Il5 (— £ x sin a + £ 2 cos a) , 



where £ { and £ s are the components of the electric vibrations along the fast and 
slow axes, respectively, and <5 is the retardance. The only component transmitted 
by the polarizer is the one parallel to the transmission axis. This component is 
given by 

£ p = cos (/3 — a) £ f + sin(/3 — a) £ s , 
so that for the signal D of the detector we obtain after some algebra 



D(a,(J,6)=k (£;£ p > 



1 

2 



I + (Q cos 2a + U sin 2a) cos 2(/3 — a) 

— ( Q sin 2a — U cos 2a) sin 2(/3 — a) cos S 
+ V sin 2(/3 — a) sin S . 



(1.33) 



A first method for measuring the Stokes parameters is to set the various devices 
at particular angles. If we are concerned with linear polarization only, the retarder 
can be removed (i5 = 0°), or, alternatively, its fast axis can be set parallel to the 
transmission axis of the polarizer (a = /3). In both cases the signal D' of the 
detector takes the simpler form 



D\P) 



I + Q cos 2 (3 + U sin 2/3 





20 



CHAPTER 1 



and the Stokes parameters I, Q , and U can be found by the combination of four 
measurements 



I = D'(0°) + D'(90°) 

Q = £>'( 0°) - D'(90°) 

U = £>'( 45°) - £>'(135°) , 



or, alternatively, with only three measurements 

2 



I = 



o-l 



V3 



L>'(0°) + £>'(60°) + £>'(120°) 
2£>'(0°) - £>'(60°) - £>'(120°) 
£>'(60°) - £>'(120°) 



For the measurement of V the retarder is essential. Setting a quarter-wave plate 
(6 = 90°) at a = 0° we get from Eq. (1.33) 



D"iP) 



1 

2 



I + Q cos 2/3 + V sin 2 (3 



and the Stokes parameters /, Q, and V are found by 



1 = D"(0°) +£>"(90°) 

Q = £>"(0°) - £>"(90°) 

V = D"( 45°) - £>"(135°) . 



This technique, however, has the disadvantage of implying a rotation of the 
exit polarizer, which is often a dangerous procedure especially if the polarization 
analysis is to be followed by a spectral analysis (a grating spectrometer, for instance, 
has a response that is rather sensitive to the polarization of the incident radiation). 
To avoid these problems one can employ, for the measurement of linear polarization, 
a half-wave plate (5 = 180°) and keep the polarizer fixed at (3 = 0°. The signal is 
now 



D"\a) = 



1 



I + Q cos 4 a+U sin 4a 



and the Stokes parameters /, Q, and U can be obtained by the relations 



I = £>'"( 0°) + £>'"(45°) 

Q = £>"'(0°) - £>'"(45°) 

U = D"\ 22.5°) - £>'"(67.5°) . 
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For circular polarization, keeping the polarizer fixed at (3 = 0°, one gets (again 
using a quarter-wave plate) 

I + Q cos 2 2a + U sin 2a cos 2a — V sin 2a , 

and the Stokes parameters /, Q, U, and V can be found by the combination of five 
measurements 



T\UII / \ 1 

D (“) = 2 



I = D"" (45°) +£>""(135°) 



Q = 

U = 



2 

2 



£""( 0°) - £""(45°) - £>""(135°) 
\d""( 22.5°) - £""(67.5°)] 



V" = £""(135°) - £""(45°) . 



A different method for measuring the Stokes parameters is based on rotating 
devices. Using a quarter- wave plate rotating with angular frequency ui and setting 
P = 0°, we obtain for the detector signal £(f) the expression 



m 




Q U 

+ — cos 4 ujt + — sin 4 tot — V sin 2 ut 



and the various Stokes parameters can be measured (for a stationary source) by 
extracting the Fourier components of the signal £(t). 

A third method is to use a compensator with modulated retardance (like a Pockels 
cell) while keeping the optical components at fixed orientations. To maximize 
the modulation effect it is convenient to set (/? — a) = ±45°, and two sets of 
measurements are necessary to get the four Stokes parameters. For instance, setting 
a = 0° and /3 = 45° we have 



£(*) = 



/ + U cos S(t) + V sin S(t) 



whereas setting a = —45°, /? = 0° we get 

1 



£(*) = 



/ + Q cos (5(f) + V sin (5(f) 



and in both cases three of the four Stokes parameters can be obtained by an analysis 
of the modulated signal. 

We want to remark that the measurement of Stokes parameters is an art in itself 
that can be pushed to a very high degree of sophistication. The few words that 
have been spent here on this subject are not meant to be complete, but just to give 
the reader an intuitive grasp on how this kind of measurements are performed in 
practice. 
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1.8. Stokes Parameters and Polarization Tensor 



In Sect. 1.6 we have found the relation between the Stokes parameters and the 
components of the electric vibration along two orthogonal unit vectors, e a and 
e b , with e a the reference direction unit vector and e b the associated unit vector. 
Equations (1.32) can be rewritten in terms of the polarization tensor defined in 
Sects. 1.3 and 1.4. From Eq. (1.25) we have 

I — k (J n + J 22 ) 

Q = k ( Jn — J 22 ) 

U = k J 21 ) 

V = ki (J 12 — </ 21 ) • (1-34) 

Introducing the so-called Stokes vector S i (i = 0, . . . , 3) via the equation 



f S o\ 

Si 




(y 

Q 






u 


\sj 




\vJ 



we have, in strict analogy with Eqs. (1-16) and (1.20) 



j = lys 

2k ^ • 
1=0 



3*3 



Sj = k Tr (cr j J) (j = 0, . . . , 3) , 



(1.35) 



where cr ■ are the matrices defined in Eqs. (1.17). 

The preceding relations between the Stokes vector and the polarization tensor 
involve the dimensional quantity k introduced in Eqs. (1.32). This is a consequence 
of the fact that the Stokes parameters have the dimensions of an intensity, while 
the polarization tensor has the dimensions of a squared electric field. To simplify 
the theoretical treatment of polarized radiation it is often convenient to leave out 
the constant k, by defining a ‘new’ polarization tensor I having the dimensions of 
intensity 

I = kJ , (1.36) 

where J is the tensor introduced in Sects. 1.3-1. 4 and k is the quantity appearing 
in Eqs. (1.32). In the following we will use the term ‘polarization tensor’, without 
distinction, for both J and I. Obviously, the tensor I obeys the relations (cf. 
Eqs. (1.14) and (1-26)) 

n 3 = hi ’ ( L37 ) 



and 



det I > 0 



(1.38) 
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In terms of 7, Eqs. (1.34) and (1.35) read, respectively 

1 = hl+ I 22 
Q = h 1 ~ 1 22 
U = I 12 + hi 
V = * (^12 ~ ^2l) 



(1.39) 



and 



' = )E s a 

j—0 

S j =Tr(* j l) (i = 0, . . . , 3) . (1.40) 

If the unit vectors chosen to define the polarization tensor differ from (e 0 , e b ), the 
relations between the Stokes parameters and the components I- are not as simple 
as in Eqs. (1.39). Let us consider, in particular, the (complex) unit vectors e + and 
e_ defined by 



e + = cos 0e a + sin 8 e* e b 

e_ = — sin# e a + cosOe'^ e b , (1.41) 



with 




Denoting by £ + and £_ the controvariant components of the electric vibration 
along the unit vectors e + and e_, defined in such a way that £ = £ 1 e a + £ 2 e b = 
£ + e + + £_e_, we have 



£ 1 = cos 8 £ + — sin 8 £_ 

£ 2 = e’^ (sin# £ + + cos 8 £_) , 



and simple relations can be established between the ‘old’ polarization tensor I i;j = 
k (£* £j) ( i,j = 1,2) and the ‘new’ polarization tensor I a/3 = k(£*£p) (a, (3 = 
+> _ ) 



hi 

I 22 

h2 



= cos 2 8 1 ++ 
= sin 2 8 1 ++ 

— T* — I 

— J 21 — e 



+ sin 2 8 1 — sin 8 cos 8 (I+-+I-+) 

+ cos 2 8 1 + sin 8 cos 8 (/+_+/_+) 

sin 8 cos 8 ( I ++ — I ) + cos 2 6 1 + _ — 



sin 2 # 
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and conversely 

7 ++ = cos 2 9 7 n + sin 2 0 I 22 + sin 9 cos 6 (e ^ I l2 + e’^ 7 21 ) 

I = sin 2 0 7 n + cos 2 9 1 22 — sin 9 cos 9 (e I 12 + e’^/ 21 ) 

I + _ = /* + = — sin 9 cos 9 (7 n — J 22 ) + cos 2 9e I ^’/ 12 — sin 2 6e^I 21 . 

Using these transformations we can express the Stokes parameters in the form 
7=7 ++ +7__ 

Q = cos 2 9 (J ++ — 7 ) — sin 29 (7 H + 7 

U = sin2#cos</> (7 ++ — 7 ) + cos20cos<^ (/_ ( + 7 f ) 

+ i sin 0 (/ + _ - 7_ + ) 

V = — sin 29 sin 0 (7 ++ — 7 ) — cos 29 sin <j> (/^ + 7 f ) 

+ i cos^ (7 + _ - 7_ + ) , (1-42) 

with the inverse relations 

7 ++ + 7„ = 7 

7 ++ I = cos 29 Q + sin 29 cos (f> U — sin 29 sin <fi V 

7 h +7 !_ = — sin 29 Q + cos 29 cos 4>U — cos 29 sin (f> V 

i(7_! —7 h ) = sin^ U + cos<j) V . (1-43) 

These formulae show that it is possible to find a particular couple of complex unit 
vectors of the form (1.41) such that the polarization tensor is diagonal. In fact, it 
can be easily shown that for (j> and 9 implicitly defined by 

tan (j) = -V/U , tan 29 = y/V 2 + U 2 / Q , (1.44) 

the components of the polarization tensor satisfy the relations 

7++ + 7 — = 7 

7 ++ - 7 = ±\/Q 2 + U 2 + V 2 

7 + _ = 7_ + =0, 

where the sign ambiguity in the right-hand side of the middle equation is connected 
with the determination chosen for the <j> and 9 angles in Eqs. (1-44). On this basis 
of complex unit vectors the electric vibration is decomposed in two independent 
orthogonal components. 
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Fig. 1.10. The rotation of the reference direction is specified by the angle a. The radiation beam 
is propagating from behind the page to the reader’s eye. 

1.9. Properties of the Stokes Parameters 

Since the Stokes parameters are defined with respect to a given reference direction, 
we must establish how they are changed when a different reference direction is 
chosen. Referring to Fig. 1.10, and denoting by (/, Q, U, V) and (/', Q ' , U\ V') the 
Stokes parameters relative to the old and new reference direction, respectively, the 
transformation law can be established with the help of the equations derived in the 
previous section. Substituting a for 6 and 0 for <j) into Eqs. (1.41), and interpreting 
the unit vectors e + and e_ as and e b , respectively, we obtain from Eqs. (1-43) 
and (1.39) 

I' = I 

Q' = cos 2a Q + sin 2a U 

U' = — sin 2 a Q + cos 2a U 

V' = V . (1.45) 

These transformations show that I and V are invariant under rotation of the refer- 
ence direction, while the linear polarization parameters undergo a rotation through 
an angle 2a in the Q -U plane, which leaves unchanged the total linear polariza- 
tion P h = \J Q 2 + U 2 . It is also apparent that the reference direction has a 180° 
ambiguity, an obvious consequence of the definition of the Stokes parameters. 

Introducing the position angle a 0 , defined by 

Q = P L cos 2a 0 

U = P h sin 2a 0 , (1.46) 

the former equations for Q' and U' become 

Q' = P L cos 2(a 0 — a) 

U' = P L sin 2(a 0 — a) . 
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The two quantities P L and a 0 are often used to characterize the linear polariza- 
tion in place of Q and U. In astronomical observations the reference direction is 
generally taken along the meridian through the observed object, and the position 
angle is measured counterclockwise from the same meridian. In solar observations 
the reference direction is sometimes chosen to represent a physical direction, like, 
for instance, the radial direction from the center of the sun to the observed point, 
or the parallel to the solar limb. 

Another fundamental property of the Stokes parameters is their additivity for 
incoherent beams. This property can be proved quite easily from Eqs. (1.32). 
Indeed, if the electric vibrations of two beams are described (in complex notations) 
by P) 1 ' (P, t) and £( 2> (P,t), respectively, and if the two beams are incoherent, so 
that 



(Op,*)* £f( p,*)> = (3 2> (p.*r = o (ij = 1 , 2 ) , 

we obtain for the Stokes parameters of the composite beam 

S k = S™ + S™ (k = 0, . . . ,3) , 

where S'). 1 ’ and S £ 2) are the Stokes parameters of the separate beams. This property, 
which can obviously be generalized to an arbitrary number of incoherent beams, 
will be referred to in the following as the addition theorem. 

As the Stokes parameters of a radiation beam satisfy the relation (see Eqs. (1.38) 
and (1.40)) 

I 2 > Q 2 + U 2 + V 2 , (1.47) 

we can write, using the addition theorem 



f 1 ) 




fi - VQ 2 + u 2 + v 2 \ 




/ V Q 2 + U 2 + V 2 \ 


Q 




0 




Q 


u 




0 




u 


\v) 




l o ) 




\ V / 



This means that any radiation beam can be considered as the incoherent super- 
position of an unpolarized beam and a totally polarized beam. The latter is char- 
acterized by a well-defined polarization ellipse, whose elements can be found from 
Eqs. (1.8) and (1.9). The former, on the contrary, is composed of natural radiation. 

Another interesting feature of the Stokes parameters stems from the possibility 
of establishing a mapping between the Stokes vectors and the points of a three- 
dimensional space that will be referred to as the Poincare space. To the Stokes vec- 
tor S = (/, Q, U, Vy we associate the point P having coordinates (Q/I, U/I, V/I), 
as shown in Fig. 1.11. When the point P lies on the surface of the sphere of unit 
radius (the Poincare sphere ), the corresponding Stokes vector represents a totally 
polarized radiation beam, while the center of the sphere represents a beam of natu- 
ral radiation. This mapping, proposed by Poincare (1892) in a classical monograph, 
is particularly suitable to visualize the effect produced on a light beam by a given 
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Fig. 1.11. Mapping of a Stokes vector (I,Q,U,V) into the representative point P inside the 
Poincare sphere of unit radius. 

device (like a polarizer or a retarder) as a stepwise movement of the representa- 
tive point on the sphere. In Chap. 5 we will see how it is possible to visualize 
the transfer of polarized radiation as a continuous movement of the representative 
point inside the Poincare sphere. 



1.10. Photons and Stokes Parameters 

As we have seen in the former sections, the operational definition of the Stokes 
parameters of a radiation beam involves at least four independent measurements 
performed by interposing polarizers or retarders in the beam path. On the other 
hand, according to the principles of Quantum Mechanics, a photon which interacts 
with a polarizer is either absorbed or transmitted; in the latter case its state is in 
general modified, the photon polarization being now parallel to the transmission 
axis of the polarizer. All information about the original polarization state of the 
photon is in any case lost, so that it is impossible to perform the four independent 
measurements needed to determine its Stokes parameters. We conclude that to 
speak about the Stokes parameters of a single photon has no physical meaning. 

Stokes parameters can only be measured for a statistical ensemble of photons. 
The connection between the measured Stokes parameters and the photon wave- 
function can be easily established. Referring to Fig. 1.7, let us suppose that the 
i-th photon has a wavefunction of the form 

l^ W > = 4° IV’i) + 4° IV>2> . 

where | ip 1 ') (|i/> 2 )) is the wavefunction in the polarization space of a photon which 
propagates along the z-axis and is transmitted through a polarizer having its trans- 
mission axis in the e x ( e y ) direction. Applying to the photon beam the same argu- 
ments developed for wave packets in Sect. 1.6, we can express the Stokes parameters 
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as 



I — k [(c-lC-l) + (c 2 c 2 )] 

Q = k' [(cfo) - (c 2 c 2 )] 

U = k' \{c\c 2 ) + ( C ; Cl )] 

V = k'x [{c\c 2 ) - {c* 2 cf)] , 

where k' is a dimensional constant and where the statistical averages (c£c ; -) are 
defined by 

( c fe c j ) = X] c fe ' ’ * c f (k,j = 1,2), 

v i=l 

iV being the number of photons falling on the measuring device in a given time 
interval At. 

Only when all the photons are in the same state, so that the amplitudes cp are 
independent of i, it is possible to determine their wavefunction from the measured 
Stokes parameters. 1 This case, characterized by the equality 

/ 2 = Q 2 + u 2 + V 2 , 

is the analogue of the macroscopic case of the monochromatic wave. On the con- 
trary, when the states of the single photons are different from each other, we have 
the more complicated situation described by Eq. (1.47), that has its macroscopic 
analogue in the quasi-monochromatic wave. From the quantum- mechanical point 
of view, the photons are said to be in a pure state in the former case and in a 
mixture of states in the latter. 



Apart from an inessential phase factor. According to the principles of Quantum Mechanics, 
absolute phases of wavefunctions are not observable quantities. 
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ANGULAR MOMENTUM AND RACAH ALGEBRA 



This chapter, and the two following, are devoted to establish the quantum- mechan- 
ical bases that will be needed to approach the theoretical problems involved in the 
generation and transfer of polarized radiation. In particular, we will recall in the 
present chapter some important concepts about angular momentum and Racalr 
algebra and will introduce a set of notations and definitions that will be useful in 
the following. 

These arguments have been treated in various classical textbooks or monographs 
(see e.g. Edmonds, 1957; Messiah, 1961; Brink and Satclrler, 1968; Varshaloviclr 
et al., 1988). The reason why they are repeated here is to spare the reader the 
painful job of translating notations and conventions from one book to the other. 
Obviously, the reader who is already familiar with the theory of angular momentum 
can simply skip this chapter. 



2.1. Eigenvalues and Eigenvectors of Angular Momentum 

Angular momentum is defined in Quantum Mechanics as a Hermitian vector op- 
erator, J, whose components along the axes ( xyz ) of a right-handed coordinate 
system satisfy the following commutation rules 1 

[• 4 ) Jy\ = l ^z 
[Jy’ Jz\ = 

[J z ,J x ]=U y . (2.1) 

From these equations it can be easily shown that the square of the angular mo- 
mentum, 

J 2 = J 2 + J 2 + J 2 , 

commutes with each of the three components 

[J*,J X ] = [J 2 ,Jy] = [J*,J X ]=0. 

Taking J 2 and one of the three components - say J z - as a maximum set of com- 
muting operators, we can look for the common eigenvectors of these two operators 
that we denote by the symbol |A to) 

J 2 |A?u) = A | Am) , J z |Am) = m |Am) . 

1 The operator J considered here is the angular momentum expressed in units of the reduced 
Planck constant h = h/(2n). 
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The Hermitian character of the operators J x , J y , and J z , together with the com- 
mutation rules (2.1), are sufficient to deduce the eigenvalue spectrum or, in other 
words, the allowed values for A and m. 

Let us introduce the operators J + and J_, usually called the shift operators, 
defined by 

J± = Jx^~ l 'Jy ■ 

Directly from the definition it is possible to prove the following relations 



(<4) f = 


(2.2a) 


[J 2 ,J ± ] = 0 


(2.2b) 


[J Z ,J ± ] = ±J ± 


(2.2c) 


£ 

ii 

to 


(2. 2d) 


J + J. = J 2 - Jl + J z 


(2.2e) 


J_J + = J 2 — J 2 - J z . 


(2.2f) 



Using these relations it can be proved that the vectors J ± | Am) are still eigenvectors 
of J 2 and J z , corresponding to the eigenvalues A and (m ± 1), respectively. From 
Eqs. (2.2b) and (2.2c) we have 

J 2 [J ± |Am>] = J ± J 2 |Am> = A [J ± |Am>] 

J z [J ± |Am>] = [J ± J Z ± J±\ | Am) = (m± 1) [J ± |Am>] . (2.3) 

On the other hand, if we consider the norms of the vectors J + |Am) and J_|Am) 
(that cannot be negative numbers), and take into account Eqs. (2.2a), (2.2e), 
and (2.2f), we have 

<(Am| J(j) J + |A to) = f\m\J_J + \\rny = (A — m 2 — m){Am|Am) > 0 

(Am| jl J_|Am) = <Am| J + J_|Am) = (A — m 2 + m)^Am|Am) > 0 , (2-4) 



so that 

A — m 2 — m > 0 , A — m 2 + m > 0 . 

Since A and m are real numbers (being eigenvalues of Hermitian operators), these 
two inequalities give 

A > 0 , |m| < A . (2.5) 

Therefore, once we fix a particular eigenvalue A for J 2 , the eigenvalues of J z must 
be confined within a limited interval. 

Starting now from a given eigenvector |Am) and applying repeatedly the shift 
operator J + (or J_, respectively), we obtain two distinct chains of eigenvectors 
corresponding to eigenvalues of J z increasing (or decreasing) by unity as shown 
in Eq. (2.3). These two chains have to stop at a certain point so that the second 
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inequality in Eq. (2.5) can be satisfied. Thus there must be a maximum eigenvalue 
for J z - say j - such that 

J + \Xj)=0. 

Equations (2.4) give for this eigenvector 

A - j 2 - j = 0 , (2.6) 



so that 

A = j(j + 1) • 

Similarly, there must be a minimum eigenvalue for J z - say j' - such that 

J- |A/>=0. 

From Eqs. (2.4) we obtain for this eigenvector 

A — j' 2 + j' = 0 , 

and comparing this equation with Eq. (2.6) we get the following relation between 
j and f 

j(j + 1) = j'{f - 1) • 

This is a second degree equation whose solutions are 

f f = ~j 

The second solution must be rejected as it leads to j' > j. We are then left with the 
first solution which implies that the minimum eigenvalue of J z is just the opposite 
of the maximum eigenvalue. 

On the other hand, we can obtain the eigenvector |A — j) from the eigenvector 
| A j) by applying to the latter a finite number of times the operator J _ ; this implies 
that the number [j — ( — j)] = 2 j is an integer, so that the only allowed values for j 
are integers or half-integers; moreover, the number of eigenvectors associated with 
a given eigenvalue of J 2 is (2j + 1). 

From now on, the eigenvectors of J 2 and J , will be denoted by the symbol |jm) : 
they are supposed to be normalized to unity and to satisfy the eigenvalue equations 

=j(j + 1) | jm) 

JJjm} =m\jm.} (m = -j, -j + 1, . . . , j) . (2.7) 

Note also that the eigenvectors | jin') obey the orthogonality relation 

= 5 jf S mm , , (2.8) 



since J 2 and J z are Hermitian operators. 
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The phase relations among the various eigenvectors characterized by the same 
j- value and different m - values must be established using suitable conventions. From 
the properties of the shift operators we have 

J+\jm} = a | j m+ 1> , 



where 

|a| 2 = j(j + 1) — m(m + 1) . 

The phase of the a constant is conventionally set equal to zero, so that 
J+\jm} = yjj{j + 1) - m(m + 1) | j m+ 1> 

= V(j + -m) I j m+ 1> , (2.9) 



and, similarly 

= yj j(j + 1) - m(m - 1) | j m-l> 

= Vti - m + l)(j + m) | j m- 1> . (2.10) 

With these phase convention, it is easily found from Eq. (2. 10) that any eigenvector 
\jrti) is related to the ‘parent’ eigenvector \jj) by the equation 



\jm > 



{j + m ) ! 

(2 j)! ( j ~ m)\ 



J J -~ m I jj) ■ 



( 2 . 11 ) 



2.2. Coupling of Two Angular Momenta: Vector-Coupling 
Coefficients and 3-j Symbols 

A fundamental problem encountered in Quantum Mechanics is the addition of the 
angular momenta relative to two independent particles, or to two different degrees 
of freedom (like orbital motion and spin) of the same particle. If J 1 and J 2 are the 
angular momenta of the two separate systems, with 

[ J iv J 2 j] = 0 ( i,j = x,y,z ), 

the angular momentum eigenvectors (normalized to unity) of the total system can 
be written as 

bii2 m i m 2> . 

each eigenvector being a dyadic product of the form \jim l ')\j 2 m 2 ) that satisfies 
the eigenvalue equations 

J i bii2 m i m 2> = ii(ii + 1) bii2 m i m 2> 

J \z\hJ2 m l m 2> = m l bli 2 TO l TO l> 

J 2 2 bii2 m i m 2> = J 2 U 2 + !) bib 2 TO i m 2> 

J 2 zbii 2 TO i TO 2 > = fn 2 \j 1 j 2 m 1 m 2 y . 
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On the other hand, the total system can also be described, from the point of 
view of its angular momentum properties, in terms of a different set of operators, 
namely J 2 , Jf, J 2 , J z , where 

•/ -/| .1 .) . 

It is easily seen that these four operators commute among each other; as a conse- 
quence, we can introduce a new set of normalized eigenvectors of the form | j 1 j 2 JM'} 
such that 



J i \Jih JM ) = k(k + 1 ) \kk JM > 
J 2 \kk JM '> = k(k ~b 1) \hh JM ) 
J 2 \j lJ2 JM) = J(J + 1) -/A/) 

J z \k32 JM ') = M \kj 2 JM} . 



Since the two different sets of eigenvectors span the same Hilbert space, they must 
be connected by a unitary similarity transformation of the form 

\j 1 j 2 JM}= Y \kk m i m 2y(kk m i m 2\kk JM ) 

m 1 m 2 

b’iJ2 m i m 2> = Y \kk JM y(kk JM \kk m i m 2'> ■ ( 2 - 12 ) 

JM 



The coefficients appearing in these expressions are called vector-coupling coeffi- 
cients, or Wigner coefficients, or Clebsh-Gordan coefficients. In shorthand nota- 
tions they are often denoted by the symbol (j 1 j 2 m 1 m 2 \JMy, as the repetition of 
the arguments j 1 and j 2 in the ket is unnecessary. 

To deduce the expression for these coefficients we must first of all establish suit- 
able phase conventions. The phase relations among the eigenvectors of the form 
\k k rr h rn 2 > are the same as those of the former section, so that we can write (see 
Eq. (2.11)) 



b’lJ 2 m l m 2 > = 



1 Ui+m i y.(j 2 +m 2 )\ 

( 2 Ji) ! ( 2 j 2 ) ! C?'i - m iV- O2 - m 2)' 



jO i~ m i 
J l- 



k- 2 \kkkk) » 



and the same convention of the former section applies also to the phase relations 
among the eigenvectors of the form |JM) corresponding to a fixed J-value. The 
remaining phase conventions will be established in the following. 

Let us consider the eigenvector of the form \j 1 j 2 m 1 m 2 ') corresponding to the 
maximum m- values (m 1 = j), m 2 = j 2 ). We have 



J z\kkk32> — Ik z + kz\ Ikkkky — (k + k^kkkky 

J 2 \jij 2 j 1 j 2 y = [J\ + + ^kzkz + J1+J2- + k-k+\ \kkkk> 

= tiitii + !) + J2U2 + !) + 2 j 1 j 2 ] bii 2 iii 2 > 

= (k+k)(k+k + !) \kkkky ■ 
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These equations show that the eigenvector | Ji J 2 J 1 J 2 ^ is a ^ so an eigenvector of the 
form | JM), with J = M = (j 1 + j 2 ). By convention we take the same phase for 
the two eigenvectors, so that 

b'i+j'2 ii +i 2 ) = bii 2 iii2> • 

The above value of J is the maximum of the possible J-values, as it can be argued 
from the fact that the maximum eigenvalue of J z is (j 1 + j 2 ). The eigenvectors 
with the same J-value and lower Af- values can be found by repeated application 
of the operator J_, so that (see Eq. (2.11)) 



b’i +k M > 



(ji+i 2 + M)! 



(2j'i + 2j 2 )! ( j 1 + j 2 — M)\ 



[Ji_ + J 2 -] ii+ia m tiikkk} • 



The eigenvectors of the form |JM) with J < (j 1 + j 2 ) can be found using a set 
of operators, O n , that will be called the supershift, operators, defined by 



r\ \ ' v ( -I \n—r jr jn—r jn jn 

\ J l+ J 2+ J l- J 2- • 

r — 0 

Let us consider the application of the operator O n to the vector bii 2 JiJ 2 )- Using 
Eqs. (2.9) and (2.10) we have, after some algebra 



° n bii 2 iii 2 > 



(n!)V(2j 1 )!(2j 2 )! 
(2ji - n)! (2 j 2 - n)! 






r — 0 



(2jj — n + r)l (2j 2 — r)! 
r! (n — r)! 



b'ii 2 3i~n+r j 2 ~r) ■ 



Applying to this vector the operator J z we have 

J z[°n\kkk32>] = til +32 ~ n )[°n\3l323l32>] - 



(2.13) 



(2.14) 



and applying the operator J 2 , after some heavy algebra that is left to the reader 
as an exercise, we have 



j 2 [°nb'ii 2 JiJ 2 >] = til + 32 - n )tii + 32 -n+1) [0„b'ii 2 iii 2 >] • (2-15) 

Equations (2.14) and (2.15) show that the vector O n \j 1 j. 2 j 1 j 2 ') is, apart from a 
normalization factor and a phase factor, a vector of the form |JJ) with J = 
tii + j 2 ~ n). Note that n cannot be larger than the smaller of the values 2j 1 and 
2 j 2 (otherwise O n \j 1 j 2 j 1 j 2 ') = 0), so that the allowed values for J must satisfy the 
triangular condition 



b'l ~k\ < J <3i+k ■ 
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The norm N n of the vector O n \j l j 2 j^j 2 ) can be easily obtained from Eq. (2.13) 

v = (n! ) 4 (2^)! (2 j 2 )! (2 j 1 -n + r)\ (2 j 2 - r)! 

" [( 2 Ji - «) ! ( 2 J 2 - «) ! ] 2 ho r! (n - r)! 

The summation above can be evaluated using some properties of the binomial 
coefficients (see App. 1 of Edmonds, 1957, for a formal proof) 

y- (2j 1 -n + r)!(2j 2 -r)! = (2 j 1 - n)! (2j 2 - n)! (2j\ + 2 j 2 - n + 1)! 
h Q r! (n — r)! n! (2^ + 2j 2 — 2n + 1)! 

and hence 

^ (n!) 3 (2j 1 )!(2j 2 )!(2j 1 +2j 2 -n+l)! 

" (2jr - n)! (2 j 2 - n)\ (2 + 2j 2 - 2n + 1)! ' 1 J 

From Eqs. (2.13) and (2.16) we can express any vector | JJ) (normalized to unity) 
as a function of the vectors of the form j] J 2 m i m 2 ) ■ Apart from a phase factor we 
have 



|JJ> =, 



C?i + J 2 — ^)- (2</ + 1)! 



(ii — J2 + ^)- (j’2 — ii + ^)- C?i + J2 + J + 1 )- 

V 2 ' f_iVi+i 2 -J-r / Oi ~ J 2 + J + r )- (2j2 ~ r )- 

h V r - Oi + i 2 — ^ — r )- 

x liii 2 J -h+ r i 2 - r > • 



(2.17) 



The phase convention that will be used in this book is that of assuming Eq. (2.17) 
(with no phase factor in front) valid for any value of J. This convention agrees with 
those given by Racah (1942), Edmonds (1957), Messiah (1961), Brink and Satclr- 
ler (1968), Varshalovich et al. (1988), and, together with the other conventions 
established previously, leads to vector-coupling coefficients that are all real. 

Now that the phase conventions have been fully established, we can turn to the 
evaluation of the vector-coupling coefficients. From Eq. (2.11) we have 



| JM> 



1 (. J + M)\ 
(2J)! ( J — M)\ 



J 



J-M 



1 jj> 



(2.18) 






[J\- + J 2 -] 



J-M 



ST' {J - Af)! T k jj-M-k 

h k\(J — M — k)\ 2 - 



Writing 
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and introducing Eq. (2.17) into (2.18) we obtain 



I JM> =\ 



1 U 1 +j 2 -J)\(J + M)\(J-M)\(2J + l) 

Ui — 32 + ^)- U 2 ~ 3i + JV- Ui + J 2 + J + !)• 

j_M ii+i 2 - J ■ . T . x, . x, 

V- / o^+.t-j-r Ul ~ J 2 + J + r )' ( 2 J2 ~ r )' 

^ ^ 1 ’ h\ r.7- w- Wri r-i. +i. - j- 

fc=0 r=0 



k! (J — M — k)\ r! (j\ + j 2 — J — r)! 

I Ui + j 2 — J + k — r)\(J — M — k + r)\ 

X V (Ji ~ 32 + J ~ k + r)! (2 j 2 -J + M + k-r)\ 

x bii 2 J~j 2 ~k+r j 2 -J+M+k-r > . 

From this expression the vector-coupling coefficients are easily calculated. Taking 
the scalar product with (j\j 2 rn -[ m 2 1 ■ the terms in the right-hand side are zero 
except those satisfying the conditions 

J — j 2 — k + r = m 1 
j 2 — J+M + k — r = m 2 . 

Adding these two equations we find 

M = m l + m 2 , 

and we can eliminate the sum over k by substituting 

k = J — j 2 — m ^ + r= J — j 2 — M + m 2 + r . 

Thus we obtain 
0'lJ2 TO l m 2l JM ) = $ m 1 +m 2 ,M 

x / (2 J + 1) (ji + j 2 - J ) ! ( J + M ) ! (■ J ~ M V- Ui - "h)i O 2 - m 2 ~ 

V Ui - h + J ) ! U2 ~ 3 i + J ) ! (ii + J 2 + J + !) ! Ui + m i)' U2 + m 2)' 



E (-D 



j 1 +j 2 - J ~ r 



U 1 - 32 + J + r ) ! (2 j 2 - r ) ! 



r=0 



{j - 32 - m l + r ) ! O 2 - m 2 - r ) ! Ul +32- J - r ) ! r ' 
This expression can be transformed into a more symmetrical one (Racah, 1942) 
O 1 J 2 m i m 2 1 JM ) = S 



(2 J + 1)0*1 + 32 — Oi — 3 2 + ( — 3 1 + 32 + ^)- 



(ii +32 + J + !)• 



X Ul + m l)'- Ul ~ ™>1 )! U2 + m 2 ) 1 U2 ~ m 2) ! ( J + M ) ! - M ) ! 

X (“ 1 ) t D ! (j’l + 32 - J - t) ! Ul - m l - t) ! C ?2 + m 2 - t)' 

t 

X (j - j 2 + m 1 + t)! ( J - - m 2 + 1)! ] 1 , 



(2.19) 
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Fig. 2.1. The addition of two vectors having fixed projections on a given axis leads, in classical 
physics, to a continuous distribution for the modulus of the resultant vector. 

where the sum over t runs over all the values leading to non-negative factorials. 

The vector-coupling coefficients have a very simple physical meaning. Given two 
independent systems of angular momenta j 1 and j 2 , suppose that the 2 -axis pro- 
jections of these momenta have been measured and found equal to m 1 and m 2 , 
respectively. The square modulus |< i 7 1 j 2 m 1 m 2 | JM)| 2 represents the probability 
of finding the values J and M, respectively, when measuring the angular momen- 
tum of the total system and its 2 -projection. The Kronecker symbol in Eq. (2.19) 
implies that the measured M-v alue must be equal to (m 1 + m 2 ), which is the same 
result found for classical (non quantum-mechanical) angular momenta. In the clas- 
sical case, however, the total angular momentum J has a continuous distribution of 
values that depend on the relative orientation of the vectors jj and j 2 ( see Fig. 2.1). 
The vector-coupling coefficients can be regarded as the quantum-mechanical ana- 
logue of this distribution; but the distribution is now discrete, which reflects the 
quantum nature of the angular momenta. 

The vector-coupling coefficients satisfy a large number of properties. Multiplying 
Eqs. (2.12) by 0'iJW , -^ 7 | an d 0'iJ2 m i m 2l) respectively, taking Eq. (2.8) into 
account, and recalling that the vector-coupling coefficients are real, one gets the 
orthogonality relations 

= s jj' s mm' (2.20a) 

m 1 m 2 

(jlj2 m l m 2\ JM >(H32 m l m 2\ JM ) = S mi m[ S m 2 m' 2 ■ (2.20b) 

JM 

The identity (j 1 j 2 m 1 m 2 \J + \JMy = OiJ 2 m i m 2 l^i- + ^ 2 - l-^O leads to the re- 
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cursion relation 

\J J(J + 1) — M(M + 1) 0’iJ2 m i m 2l^ M+iy = 

= + !) - m 1 (m 1 - 1) (jj 2 m 1 - 1 to 2 |JM> 

+ \/j2(j2 + !) - w 2 (m 2 - 1) <j \j 2 m 1 m 2 -l| JM) . (2.21) 

The other properties of the vector-coupling coefficients are more easily expressed 
in terms of the 3-j symbols, which are defined by 

^ (2.22) 

Nowadays the 3 -j coefficients are employed more often than the vector-coupling 
coefficients. We just summarize below a set of relations that will be needed later. 
Formal proofs can be easily obtained with the help of Eqs. (2.19), (2.20), and (2.22). 

a) Orthogonality relations 

5>+l>(: ‘ ()(: ‘ $)=*«*„ (2.23a) 

oi/3 x 7 x 7 

b 2 - [ )(: b P ;)(:■ i < 2m > 



b) The 3-j symbols are invariant under cyclic permutations of their columns and 
are multiplied by (— l) a+h+c under non-cyclic ones 

(: j ')=(? ; j <™> 

c) They are multiplied by (— l) a+b+c under sign inversion of the second row 



a b c \ 

a 0 7 / 



(- 1 ) 



a+6+c 



a 

—a 




(2.25) 



d) By assigning particular values to some of the arguments, compact analytical 
expressions can be easily obtained; we have for instance 



/a b 0\ 
a 0 0 J 




fa a 1 \ 
a —a — 1 1 J 



^ a ~ as ^7^TJ 



(-1) 

(- 1 ) 



1-a+a / a + a + 1 

y (2a + 2) (2a + 1) 

a-a / (a-a)(a + a+l) 

V 2a(a+ l)(2a+ 1) 



(2.26a) 

(2.26b) 

(2.26c) 
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a a 1 

a —a 0 



a a + 1 1 

a —a — 1 1 



= (- 1 )° 



= (-I)”' 



= (- 1 )° 



/a(a + 1)(2 a + 1) 

I (a + a. + 1 ) (a. + qi + 2) 

(2a+ l)(2a + 2)(2a + 3) 

I (a — a + l)(a + a + 1) 
V (a + l)(2a + l)(2a + 3) 



(2.26d) 



(2.26e) 



(2.26f) 



Analytical expressions for the 3 -/ coefficients with one of the three elements of the 
first row set at 3/2 or 2 can be found in Brink and Satchler (1968); analogous 
formulae with one /-value equal to 5/2 are given by Saito and Morita (1955), and 
with one /-value equal to 3 by Falkoff et al. (1952). These analytical expressions, 
together with the various numerical tables of 3-/ symbols (or vector-coupling coef- 
ficients) prepared by several authors (Alder, 1952; Simon, 1954; Simon et al., 1954; 
Rotenberg et al., 1959; Varshalovich et al., 1988) have nowadays lost some of their 
interest due to the fact that, with modern computers, the direct computation of 
3-/ symbols via Eqs. (2.19) and (2.22) can be easily performed. A sample Fortran 
code is given in App. 1. 



2.3. Coupling of Three Angular Momenta: Racah Coefficients 

and 6-j Symbols 

In the previous section we have shown how it is possible to couple two angular 
momenta J 3 and J 2 to get the resultant vector J. When three angular momenta 
are present, say J ± , J 2 , and J 3 , one can follow the same line of reasoning, by first 
adding two angular momenta and then adding the third one to the sum of the first 
two to obtain, as a final result, the total angular momentum J. 

In this procedure we are however faced with some ambiguity. In fact, we could 
start from the couple ( J/, J 2 ) to obtain J 12 = J 1 + J 2 , and then add J 12 to J 3 to 
obtain J. Alternatively, we could start from the couple (J 21 J 3 ) to obtain J 23 = 
J 2 + J 3 , and then add J 23 to J 3 to obtain again the resultant J. Finally, we could 
introduce the intermediate vector J 13 — J 1 + J 3 and then add J 13 to J 2 . 

These different coupling schemes are related to the fact that, given three com- 
muting angular momentum operators J 1 , J 2 , and J 3 , three different sets of six 
commuting operators can be considered, namely 



I) Jl, Jl J 3 2 , j 2 2 , j 2 , j z 

II) J 2 , J 2 , J 2 , j 2 3 , j 2 , j z 

III) J 2 , Jl Jl Jl, J 2 ,J Z , 



IV) Jl , J 2 , J 3 , Ji z , J 2z , J% z ■ 



besides the standard set 
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Expressing the eigenvectors of sets I, II, and III in terms of the eigenvectors of 
set IV, it is possible to find the transformations connecting the different bases. The 
coefficients entering these transformations are called the recoupling coefficients and 
can be derived as follows. 

Denoting the eigenvectors of set I by the symbol 



I(iii2)ii2>i3- JM > i 

we apply twice the first of Eqs. (2.12) to obtain 

I(ili2)il2>j3> JM > = bl2i3 rn 12 m 3/ > 0l2i3 TO 12 TO 3lil2i3'^O 

m 12 m 3 

\hj2te m \ m 2 m Z> 



(2.27) 



X Oli2 TO l TO 2lili2il2 rn 12> Ol2i3 TO 12 TO 3bl2i3 JM > > 



(2.28) 



where we have used the symbol 

blJ 2 i3 m l m 2 m 3> = bl TO l>b2 TO 2>b 3 TO 3 > 

to represent the eigenvectors of set IV. 

Similarly, for a given eigenvector of set II of the form bi> {kk)k^ J'M'y we 
obtain 

b'i> (i2i 3 )i2 3 > J m y = 

= \kkk m \ m 2 m ^y 

m x m 2 nn z m 2z 

x <i2i3 TO 2 TO 3b2i 3 i23 TO 23><ili23 TO l m 23bli23 J M ) • (2.29) 

If we now take the scalar product of the two vectors in Eqs. (2.28) and (2.29) we 
obtain, using the shorthand notation of the vector-coupling coefficients 

<(iii2)ii2>i 3 > JM\j v {j 2 k)k^ J m y = 



Ol k m l m 2 \jl2 m 12 y Ol2 j 3 TO 12 TO 3 1 ■ JM y 



X <i2i3 TO 2 TO 3 b23 TO 23> 0'lJ23 m l TO 2 3 l J ' M '> S JJ ' S MM> • ( 2 - 30 ) 



The scalar product in the left-hand side is independent of the M-value, as it can 
be easily proved by evaluation of the matrix element 



<(iii2)ii2>i 3 > JM\j_j + \h, {kjkkii JMy 
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Fig. 2. 2. When adding in classical physics three angular momenta J 3 to give the resultant J, 

once the sum of two of them and J 2 ) is specified, the modulus of (J 2 + 7 3 ) shows a continuous 
distribution. 



via Eqs. (2.9), (2.2a), and (2.2f). On the other hand, owing to the properties of the 
vector-coupling coefficients, the summation in the right-hand side can be restricted 
to m 1 , m 2 , and m 3 , with the condition (m, + m 2 + m 3 ) = M. 

The scalar product now derived is called the recoupling coefficient and has a 
simple physical meaning. Given three vectors j 1: j 2 , and j 3 , that combine to give 
the resultant J, the square modulus of the recoupling coefficient is the probability 
that, if by measuring (j) +J 2 )' 2 we found the value j 12 (j 12 -|-1), then, when measuring 
(j^-l-JJ,) 2 , we find the value j 23 (j 23 + 1 ) ■ As illustrated in Fig. 2.2, the modulus of the 
vector j 2 3 is undetermined in classical physics too; it has a continuous distribution 
of probabilities which originates from purely geometrical reasons. The quantum 
nature of the angular momenta adds to this ‘geometrical indetermination’ to give 
a discrete distribution for the probabilities. 

Dropping the inessential parameter M, the recoupling coefficients can be used to 
express the transformations between different bases in the form 

b'i. 02^3)^23’ JMy = y ' I(iij 2 )ji2.j3> JMy 

i 12 

x <(iii2)ii2>i3> (i2i3)i23> J > • (2-3i) 

In place of the recoupling coefficients it is customary to introduce different symbols 
like the Racah coefficients (or Racah functions ) defined by 

WijtfzJjs’, ji2j 2 3) = 

1 

\/(2ji2 + l)(2j 23 + 1) 



<(J1J2)J12) J3> J lil> (J 2 i3)i23> J > i (2.32) 
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or the more symmetrical 6-j symbols 



h J r = ( - 1)il+j2+J3+Jw( ^ 2 ^ 3; ^3)- 



(2.33) 



Starting from Eq. (2.30), and using Eqs. (2.22), (2.32), and (2.33), it is possible 
to express the 6-j symbol as a sum of products of four 3 -j symbols 



See'S* 



\d e f I = (- i) b + c + d +P+~i+s (2e + 1) 



c d e 

7 —8 e 



Qip-yScj) 

d b f 

8 ~/3 cj> 



b c a 

(3 —7 a 



f a e! 



(2.34) 



It should be remarked that, owing to the symmetry properties (2.24) and (2.25), 
the right-hand side can be written in several equivalent ways, obtained by column 
permutations or by sign change of the second row of each 3 -j symbol. Since for 
each 3 -j symbol there are six permutations of the columns and two determinations 
for the sign of the second row, 12 4 different realizations exist for the right-hand side 
of Eq. (2.34), apart from the ordering of the 3 -j coefficients. All these realizations 
have however the same ‘topological invariant’ which can be visualized by graphical 
methods. The theory of graphical methods for angular momentum problems can 
be found in Edmonds (1957) or in Brink and Satchler (1968). 

The 6-j symbol has several important properties. First of all, it is zero unless four 
triangular conditions are satisfied. These conditions can be illustrated as follows 




Another remarkable property is the invariance of the 6-j symbol both under inter- 
change of any two columns and under interchange of the upper and lower arguments 
in any two columns. 

An analytical expression for the 6-j symbol can be obtained from Eq. (2.30) by 
substituting the series expansion (2.19) of the vector-coupling coefficients. After a 
long calculation, Racalr (1942) gives the following expression 

{d e r} = A ( a6c ) A («e/) A(d6/) A(dec) ^(-l ) z (2 + l)! 

x [(2 — a — b — c)! (z — a — e — /)! (z — d — b — /)! (z — d — e — c)\ 

x(a + b + d + e — z)\(b + c + e + f — z)\(a + c+ d + f — z)\] 1 , (2.35) 



where 



A (abc) = 



' (a + b — c)\ (a — b + c)! (—a + b + c)l 
(a + b + c+ 1)! 
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and where the sum is extended to all the values of 2 leading to non-negative facto- 
rials. 

By specifying the value of one of the arguments, simple analytical formulae can 
be obtained, e.g.: 



a b 
d e 



= t ab 6 ed (- l)° +e+/ 



y ( 2 a + l)( 2 c? + 1 ) 



(2.36a) 



/ a a "b 2 2 

\ b + \ b c 



('_l') a + b + c +i I / ( a + b + c + 2 ) (a + 6 - c +~ 1 )~ 
1 ’ 2 y (a+l)(2a+l)(6+l)(26+l) 



(2.36b) 



f a a+i 5 1 _ s.a+ 6 +c 1 I (a — b + c + l)(c — a + b) 

5 " cj 1 J 2y (a +l)( 2 o+ 1)6(26+1) 



(2.36c) 



Ja a 1 1 ^ 1 ^Q + b + c + i a(a + 1) + b(b + 1) — c(c + 1) 

\ b b c j 2 ^/ a (a + l)(2a + 1)6(6 + l)(2h + 1) 



(2.36d) 



j ^ U-t-1 l) / .. \a+6+C 

\b + l b 

1 / (ci -|- 6 -|- c -|- 3)(ci -|- 6 -|- c -|- 2)(ci -|- b — c - b 2)(ct H - 6 — c -|- 1) 
X 2y (a + l)(2a + l)(2a + 3 )(b+ 1)(2 b + 1)(2 b + 3) 



(2.36e) 



(a a + 1 1 

\b b c 



(- 1 ) 



a+b+c+1 



1 I (a + b + c + 2) (a — b + c + l)(a + b — c + l)(c — a + b) 
X 2y (a + l)(2a + l)(2a + 3)6(6 + 1)(26 + 1) 



(2.36f) 



J u a -(- 1 l) , ^ , d | c 

\b-l b cj 1 j 

1 I (c— a + 6)(c — a + 6 — l)(a — 6 + c + 2) (a — 6 + c + 1) 
X 2 y (a + l)(2a + l)(2a + 3) (26 - 1)6(26 + 1) 



(2.36g) 



\ CL CL Z I / i \ a _|_5_|_ c 

\ 6 b c J ~ 1 J 

3 s(s + 1) — + 1)6(6 + 1) 

X 2 

2 y/ (2a - l)a(a + l)(2a + l)(2a + 3) (26 - 1)6(6 + 1)(26 + 1)(26 + 3) 

where 

s = c(c + 1) — a(a + 1) — b(b + 1) . 



(2.361r) 
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Further analytical formulae have been given by Biedenharn et al. (1952), while 
numerical tables have been prepared by Biedenharn (1952), Sharp et al. (1954), 
Simon et al. (1954), Rotenberg et al. (1959), and Varslralovich et al. (1988). As 
noted in the previous section, these tabulations have nowadays lost much of their 
interest since modern computers provide the possibility of computing 6 -j symbols 
with simple numerical codes like that in App. 1. 

Other important properties of the 6 -j symbols, that can be proved either from 
the analytical expression (2.35) or directly from their definition, are given below. 



Sum rules: 



£<-!)-<*+ 1){: ‘ ;}=i 



^(_l)a +h+ c ( 2 c+ l)|« b «| =(5/0V / (2a+1)(26+1) 



E(2c + D(2/ + i){» j )}{“ ; 

2 (-!)«»+“ (2 c +!){“ * ;}{: 5 



a b c 1 J a b c 1 _ J a e / 
d e /Me d g ) \b d g 



(2.37) 



(2.38) 



(2.39) 



(2.40) 



E (_\ \a+b+c+d+e+f+g+h+i+j ° b clja b C \ T <7 h C 1 

1 \d e f j \g h i j \e d j j 



f * 3 \ if i 3 
g d b J \h e a 



(2.41) 



Contraction of 3-j symbols: 



^ ) (_-|^fc+c+d+/3+7+<5 ( c d, e 
Pis S e 

= ( f a e\ f / a e) 

V 6 a £ I 1 c d b f 



d b f \ ( b c a 

S —(3 <f> J l (3 —7 a 



(2.42) 



£(-D 



a-\-b-\-c-\-d—e-\-f—ot—8 



( 2 / + 1 ) 



a b e \ ( d c e 

a (3 — e y \ A 7 e 



a b e 
d c f 



c a f 



b d f 

(3 5 — (f> 



(2.43) 
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2.4. Coupling of Four Angular Momenta: 9-j Symbols 



When four angular momenta are present, say J 1? J 2 , J 3 , and J 4 , different coupling 
schemes can be followed, as in the case of three angular momenta discussed in the 
previous section. For instance, one can start from the couples (J 1 , J 2 ) and (J 3 , J 4 ) 
to obtain the intermediate vectors J 12 = J x + J 2 and J 34 = J 3 + J 4 , and then add 
J 12 to J 34 to obtain the resultant J. Alternatively, one can start from the couples 
(./-, , J 3 ) and (J 2 , J 4 ) to get the intermediate vectors J 13 = J 1 + J 3 and J 24 = J 2 + J 4 , 
and then add J 13 to J 24 to obtain again the resultant J. A third possibility would 
be to introduce the intermediate vectors J 14 = J 4 + J 4 and J 23 = J 2 + J 3 . 

These different coupling schemes are related to the fact that, given four indepen- 
dent angular momenta, three different sets of eight commuting operators can be 
constructed, namely 



I) 


J 2 

J 1 5 


J 2 

°2 j 


l 2 

J 3 > 


/ 2 


/ 2 

J 12! 


r 2 

^34! 


J 2 , 


•0 


II) 


J 2 

J 1 5 


l 2 

°2 1 


/ 2 


7 2 


T 2 

J 13! 


7 2 

^24! 


J 2 , 


•0 


III) 


J 2 

J 1 5 


J 2 
°2 ? 


J 2 

J 3 ’ 


7 2 

^4 ? 


7 2 

J 14! 


7 2 

^23! 


J 2 , 


J Z ! 


besides the standard set 


















IV) 


J 2 

J 1 5 


l 2 
°2 ? 


/ 2 
^3 5 


7 2 

^4 5 


A z ! 


^2 zi 


•Oz 


! ^Az 



Denoting the eigenvectors of set I by the symbol 

I(iii 2 )ii 2 > 03^4)11345 JMy 
and those of set II by the symbol 



I0ii3)ii3> (323a)324i 1 

the two bases are connected by the transformation 

I 0 ii 2 )ii 2 - = 1 0 ii 3 )ii 3 5 02114)1124! JMy 

3 13 ^24 

x <0*1^3)^135 02114)1124! ^| 0 ii 2 )ii 2 ! 03114)1134! JMy . 

The recoupling coefficients defined in this equation are independent of M (see the 
comments about Eq. (2.30)). In their place different symbols are usually intro- 
duced, like the 9-j symbols defined by 



<(11*1113)^135 02114)^24! 0 iH 2 )ii 25 (113114)^34! jy — 



— yf (2ji2 + l)(2j*34 + 1)(2 Ji 3 + l)(2j 24 + 1) 



k 

32 
3 12 



Js 



3a 
3 m 



3 13 

324 

J 



(2.44) 
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The 9 -j symbol can be easily expressed in terms of 3 -j coefficients by means of a 
procedure similar to that of the previous section for the 6 -j coefficients (Eqs. (2.27)- 
(2.30)). The final result is the compact formula 




fa d g\ f b e h\ f c f i\ 

\a S x) \P e V ) \7 <t> f. ) ’ 



where the sum runs over all the Greek symbols except a, namely over /?, 7, 6, e, 

<t>, x , v, £• 

The 9 -j symbol can be also expressed in terms of 6 -j symbols. We start from 
the vector \{ad)g, (be)h, z) and neglect the fact that the vector h is obtained by 
combining b with e. From Eq. (2.31) we have 

I {ad)g, (. be)h,i > = E fa, (dh)k, i\(ad)g, h, 1 ) |a, (dh)k, z) . (2-45) 

k 

Recalling that h is the resultant of b and e, we recouple the three angular momenta 
d , 6, e to obtain 

| a, ( dh)k , 1 ) = (b, ( de)f , k\d, ( be)h , fc) | a, ( bf)k , z) . (2-46) 

/ 

Finally we recouple the three vectors a, 6, / to get 

I a, ( bf)k , *> = E <( a& ) c > /> *l a > 0 l( afe ) c ) /, *> • (2-47) 



Substituting Eq. (2.47) into Eq. (2.46) and then into Eq. (2.45), and recalling that 
the vector / is the combination of d and e, we obtain 

I (ad)g, (be)h, z> = E <(a, (■ dh)k , z| (ad)g, h, z) <(5, ( de)f , k\d , ( be)h , fc) 

kfc 

x <(a&)c, /, i|a, (6/)fc, i)|(ab)c, ( de)f , z> . 

The three recoupling coefficients can be expressed in terms of 6-j symbols. For this 
purpose it is necessary to recall the definitions (2.32)-(2.33) and to observe that, 
owing to the properties of the vector-coupling coefficients, the order of the various 
angular momenta can be changed according to the following rules 



\a, (bc)d, e) = (— l) d b c |a, (cb)d, e) 

| a, ( bc)d , e) = (— l) e_a_d |(6c)d, a, e) . 
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Using these properties and the definition of the 9 -j symbol given in Eq. (2.44), we 
obtain the remarkable expression 



( 



a 

d 

9 



b 

e 

h 



c 

f 

i 



]T(-l) 2fc (2fc + l) 



a i 
h d 




f 

h 





(2.48) 



The 9 -j symbol satisfies several important properties. First of all, the symbol is 
zero unless the six triangular conditions illustrated below are satisfied 



0 — 0 — 0 



0 — 0 — 0 



! o o 

1 1 

o o 

1 1 

o o 



0 

1 

0 

1 



Moreover, it is invariant under reflection about either diagonal, and is multiplied 
by (— l) 4 , t being the sum of its nine arguments, under interchange of two adjacent 
rows or columns. 

If one of the arguments is zero, the 9 -j symbol reduces to a 6 -j symbol times a 
constant. This can be proved directly from Eq. (2.48), which gives 



a b c 
d e f 
g h o 



6c f 6 gh (-l) 



b+c+d+g 



\J (2c + 1)(2 g + 1) 



b 

d 



(2.49) 



For the numerical evaluation of 9 -j symbols Eq. (2.48) can be conveniently used, 
and numerical tables are also available (Smith and Stevenson, 1957; Smith, 1958; 
Varshalovich et al., 1988). However, as already mentioned about 3 -j and 6 -j sym- 
bols, simple computer codes like that in App. 1 are nowadays generally preferred. 
Other important properties of the 9 -j symbols are given below. 

Orthogonality: 

( a 

£(2c + 1)(2/ + l)(2c/ + 1)(2 h + 1) < d 
cf { g 



Sum rule: 

{ a b c] ( a d (a b 

d e / e b h\=le d (2.51) 

g h i \ I j k i J I j k i I 



e 

h 




= <W<W (2-50) 
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Contraction of 3-j symbols: 



( 



a 

a 



b c 

P 7 



a b 

d e f \ = 
9 hi j 



( b e h\ ( c f i\ ( a d g\ ( d e f \ ( 9 h i\ 

yf. \P e v)\x 4> Cy\ a 5 x) V 5 6 PJ\x v 




( c f i\ f a d g\ f d e f\ ( g h i\ 
W P f) [a S x)\S e 0 J \ X V ZJ 



E^ + mc + r (: ' * 

_ (a d g\ f d e f\ f g h i\ 

\ a S x) \d e (t>) \X V 



a b 

d e f\ = 
g h i ) 



(2.52) 



(2.53) 



(2.54) 



Contraction of 6-j and 9-j symbols: 



(2c + 1) 



a b c 
d e f 
9 h i 



a b c 

f i k 




i k 
d g 



b f k 1 
d h e J 



(2.55) 



(-1 y+c+d+f-b-g-h-l ( 2s + ^ 2t + ^ 



X 



a 

d 

9 

a 

d 

9 



b 

t 

l 

b 

e 

h 



c 

s 

k 



c f 
j k 



c 

f 

i 



9 h if 
J k If 



d 

j 



f 

t 



b 

j 




(2.56) 



Finally, we want to remark that the methods illustrated so far for the coupling 
of 2, 3, and 4 angular momentum operators can be directly generalized to any 
number of operators. The addition of n angular momenta will involve recoupling 
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coefficients depending on 3 (n— 1) parameters. For n = 5 two different types of 12 - j 
symbols have been introduced (Jahn and Hope, 1954; Ord-Smith, 1954; Elliott and 
Flowers, 1955). Fortunately we will not need such symbols in this book. 



2.5. Rotations and Euler Angles 

It is well-known that in Quantum Mechanics the state of a physical system is 
described by a vector in the Hilbert space, while the dynamical variables of the 
system are described by linear Hermitian operators. According to the postulates 
of Quantum Mechanics, the result of any given measurement accomplished on the 
system is expressed in terms of probabilities, which are calculated by taking the 
square modulus of scalar products between appropriate vectors of the Hilbert space. 
Similarly, the mean value of any given observable (its expectation value) is given by 
the diagonal matrix element where | -0) is the state vector of the system 

and O is the Hermitian operator associated with the measured observable. 

When a rotation is performed in the ordinary three-dimensional space, two differ- 
ent points of view can be followed: either a passive point of view which consists in 
rotating the observer’s coordinate system leaving the physical system unchanged, 
or an active point of view which consists in rotating the physical system leaving 
the coordinate system unchanged. 

Adopting the first point of view, the state vector |^) of the physical system 
remains unchanged, while the Hermitian operator O corresponding to a classical 
observable changes into a different operator, O', connected with O by the same 
transformation that holds for the corresponding classical observable (Correspon- 
dence Principle). This can be performed by introducing a unitary similarity trans- 
formation on the operators, 



O' = D(R)OD\R) , (2.57) 

where D(R) is an appropriate linear operator such that D'(R) = D~ 1 (R) and 
D(R~ 1 ) = D~ 1 (R ), where R~ l is the inverse rotation of R. 

On the contrary, adopting the active point of view, the operators remain un- 
changed, while the state vector | ipy changes into the new state vector |r//) given 
by 

IVO = D(R)\ipy . 

Obviously these transformations satisfy the property 

W IVO = {^\o\ipy , 

which means that if the same rotation is performed both on the coordinate system 
and on the physical system, the expectation value of any observable remains the 
same. 

Note that the formalism here introduced is consistent with the intuitive fact 
that an arbitrary rotation R performed on the coordinate system is equivalent to 
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Fig. 2. 3. To find the variation 8v of the components of the vector v under a positive rotation of 
the coordinate system through an angle <f> about the u-axis, we observe that this variation is the 
same as that obtained by rotating the vector v through an angle —(f) about the same axis. The 
figure shows that for an infinitesimal rotation of the arbitrary vector w through an infinitesimal 
angle a we have Sw = a u x w, so that 8v = — 4>u x v. 

the inverse rotation R _1 performed on the physical system. This can be formally 
proved by considering the quantity 

On one hand, this expression can be regarded as 

<$\[D{R)OD\R)]wy = wmy , 

which represents the expectation value of the operator O after a rotation of the 
coordinate system (passive point of view). On the other hand, the same expression 
can be considered as 

' W\D(R)]o[D\R)\<py] = , 

which represents the expectation value of the same operator after the inverse rota- 
tion of the physical system (active point of view) . 

The expression for the operator D{R ) can be easily found in the following way. 
We consider a physical system referred to a given coordinate system, and fix our 
attention on a particular observable like the vector v (representing for instance the 
position or the velocity or the angular momentum of a particle, etc.). We now 
perform a positive rotation of the coordinate system through an infinitesimal angle 
(f) about the unit vector u, where by positive rotation we mean, here and in the 
following, the one that makes a right-handed screw advance along the direction of 
il. The vector v will be changed by the quantity (see Fig. 2.3) 

Sv = — 4> u x v . 



(2.58) 
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According to Eq. (2.57) the operator D u (cf > ) corresponding to this infinitesimal 
rotation must therefore satisfy the relation 

v' = v + Sv = D u {<j>) vDl(<j>) , (2.59) 

with Sv given by Eq. (2.58). 

Let’s now recall that the commutation relations between any dynamical variable 
of vectorial character and the total angular momentum J are of the form 1 

[4> v i) = i e kim v m (*» l,m = x, y, z) . (2.60) 



From Eqs. (2.59) and (2.60) it can be easily proved that 

D u (<j>) = l-id>J u , (2.61) 

where J u = J-u is the projection of the total angular momentum on the 'u-direction 
(note that Dl(</>) = D~ 1 {4>) = D u (-<j>)). 

From Eq. (2.61) we can find the expression for the operator corresponding to the 
rotation through a finite angle O. To this aim, we divide the angle O into a large 
number n of equal parts, and perform the finite rotation as the succession of n 
infinitesimal rotations of amplitude Q/n. We have 



D u(ty = lim 



1 - !— J u 

n 



= lim > 

n. — ton • ^ 



n\ (-i nj u ) r 



= £ ( ~ in . J " ) lim 



oo ^ r\ (n — r)! n r 

r—0 v J 



n\ 



r—0 



! n — >oo (n — r)\ i 



and since the limit gives 1, we obtain 



D,( Si) = £ = e- 10 -'* , (2.62) 

r=0 

where, by definition, the exponential of an operator has the usual meaning given 
by its Taylor expansion. 

Having established the expression for the operator D corresponding to a finite 
rotation about an arbitrary axis, we can easily find its expression for the most 

1 For a system consisting of one particle, if r, p, l = j- r x p, s, and J = l + s are the 

position, momentum, orbital angular momentum, spin, and total angular momentum operators, 
respectively, Eq. (2.60), with v denoting any of these vectors as well as any linear combination of 
them, can be easily deduced from the fundamental commutation rules 

h> r j] = h> Pj\ = o - KPj] = m <Sy , [Sj, sj = i e ijk s k ■ 

The generalization to the case of many-particle systems is straightforward. 
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Z 




Fig. 2. 4. Definition of the Euler angles. 



general rotation of the reference system. But we must first introduce the Euler 
angles, which are defined as follows. 

Referring to Fig. 2.4, we first perform a rotation through an angle a (0 < a < 27 t) 
about the 2 -axis of the original reference system. This rotation brings the y- axis 
into a new axis, called the line of nodes, or ui-axis. Next we perform a rotation 
through an angle f3 (0 < /? < if) about the line of nodes. This rotation brings 
the 2 -axis of the original system into a new axis, called the figure-axis, or 2 / -axis. 
Finally we perform a rotation through an angle 7 (0 < 7 < 27 t) about the figure-axis 
to get the new system ( x'y'z '). 

For the general rotation specified by the three Euler angles (a, /3, 7 ) we have 

D{a,p, 1 )=D z ,{ 1 )D w {p)D z {a) , (2.63) 

an expression which has the disadvantage that the rotations (3 and 7 are relative to 
axes that have been carried over by previous rotations. To overcome this drawback 
we observe that 1 

D w ((3) = D z (a) D y {(3) D z (-a) (2.64) 

Equation (2.64) is self-evident from geometrical considerations, since 
D W (P) D z (a) = D z (a) D y {(3) ; 

however, it can also be proved analytically. From Eq. (2.62) we have 

£>Jd)=e- i/3J » , 
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and that 

DAl) = DJP) D z (rt) D w (-(3) . (2.65) 

Substituting Eqs. (2.65) and (2.64) into Eq. (2.63), and noting that two rotations 
about the same axis commute, we obtain 

D(a,0, 7 ) = D z (a) D y (0) D 2 ( 7 ) = <T iaJ • e~ iPJ * e~' nJ * . (2.66) 

This expression shows that the rotation described by Eq. (2.63) can also be achieved 
by performing three successive rotations about the axes of the old reference system 
in the following order: first a rotation through an angle 7 about the 2 -axis, then 
a rotation through an angle (3 about the y- axis, and finally a rotation through an 
angle a about the 2 -axis. The former realization of the rotation, involving the 
line of nodes and the figure-axis, is easier to visualize; the latter is more useful in 
practice, since it involves rotation operators depending on the projections of the 
angular momentum along the axes of a unique coordinate system. 



2.6. Rotation Matrices 



The matrix elements of the rotation operator D(a , (3, 7 ) between eigenstates of the 
total angular momentum are called rotation matrices. These quantities, which are 
explicitly defined by the relation 1 

V J mn (R) = < JM\D{R)\JN > (2.67) 

with the symbol R representing the triad (a, /3, 7 ), are of fundamental importance 
in angular momentum theory. Their physical meaning descends directly from that 
of the rotation operator D{R ) defined in the previous section. Given a physical 
system of angular momentum J, the square modulus 

\V j M n(R)\ 2 



represents the probability of finding the value N for the 2 / -axis projection of the 
momentum after its 2 -axis projection has been measured and found equal to M, 
the system ( x'y'z ') being obtained from (xyz) through the rotation R. Note that 



where J w is the operator obtained from J y by a rotation through an angle a about the z-axis, 
J w = D z {a) J y D~\a) = D z {a) J y D z {-a) . 

Taking this expression into account we obtain, by a power series expansion of the exponential 
D w (/?) = D z (a) e~ il3J y D z (-a) = D z (a) D y (/?) D z (-a) . 

The same argument can be applied to deduce Eq. (2.65). 

Note that the matrix elements < JM\D(R)\ J'M'y, with J' / J, identically vanish. This is 
a consequence of the fact that the operator D(R) commutes with J 2 . 
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if | JM) is the eigenvector corresponding to the projection M of the angular mo- 
mentum along the z-axis of the old reference system, the vector D(R)\JMy is 
the eigenvector corresponding to the same projection along the axis z' of the new 
reference system. 

To find an explicit expression for the rotation matrices, we first define the reduced 
rotation matrices d J MN . From Eq. (2.66) we have 



V J MN {a^) = < JM\e 



~ iaJ > e~ i0J « e~ hJ *\JN) = e“ i(aM+7JV) d 



mn(0) 



( 2 . 68 ) 



where 

d J MN {0) = (JM\e~ i0Jy \JNy . 

The reduced rotation matrices can be easily calculated in the case J = 1/2. In- 
troducing the spinors | + ) and | — ) to represent the eigenstates | JM) having 
J = 1/2 and M equal to +1/2 and —1/2, respectively, we have from Eqs. (2.9) 
and (2.10), taking into account that J y = — i (J + — J_)/2 



J y \ + > - g I “ ) 

so that, with easy transformations 
— i/3J 



Jy\ — ) — I + ) ’ 



+ > = 






P 



P 



n = 0 



7.! 



^| + >=cos^| + >+sin^|-> 



and similarly 

e »|->=cos-|->-sm-| + >. 

Thus the reduced rotation matrices for J = 1/2 are the following 

d\i{0) = d?_ 1 _i{0) = cos ^ 

2 2 2 2 Z 

d 0ii(0) = ~d\A0) = sin ^ . 

2 2 2 2 Z 

Turning to the general case, let us consider an angular momentum eigenstate of 
the form |JJ). This eigenstate can be expressed in terms of spinors by regarding 
the angular momentum J as the result of the addition of 2 J momenta J 1 , J 2 , , • ■ , J 2 j 
each equal to 1/2. In this representation we have, with obvious notations 



\Jjy — I + )ll + >2 " ‘ I + >27 > 

where | + ) j is the normalized eigenvector of the operator J iz corresponding to the 
eigenvalue +1/2. 

In the same representation, the eigenvector | JAf) can be obtained by repeated 
application of the operator 



J_ — J, _ + J 2 _ + • • • + J 2 j_ • 




ANGULAR MOMENTUM AND RACAH ALGEBRA 



55 



|J J-1> = ^=J_|JJ>= 1 



V2J" 

— — ( i 

V2J 1 



V2 JL 



Starting from the eigenvector | JJ) we have from Eq. (2.11) 

"A- + ^2- ■+ h ^2 J- I + )ll + )2 " • I + ) 2 J 

= {l _ )ll + >2 " ' I + >27 + l + )ll~>2'"l + >27 + 

+ I + >ll + >2 " ' I - >2j} i 



and expressing in succession the eigenvectors |J J — 2), |J J— 3), ... we finally 
obtain 



| JM> = 



1 (J + M)\ (J - M)\ ^r 



(2J)! 



E( j->i->---i->J + >i + >---i + > } - 



( J— M)terms 



( J+M)terms 



with the sum extended to all the permutations having (J — M) spinors of the form 
| — ) and (J + M) spinors of the form | + ). 

Thus we can write 



d J MN {0)=(JM |e i0J y\ JN) = 



X 



E{<_K 



VV + M)! (J - Af)! (J + iV)! (J - JV)! 

(2J)! 

< ~ I < + I < + I ■ ■ • < + |j e~ W Jl » + Ja » + '" +JaJ » ) 



P 



(J— M)terms (J+M)terms 

x Hi j - >i - > • • • i - >j + >i + > • • • i + > } • 



( J— iV)terms 



( J+AT)terms 



Let us consider a single term in the sum over q\ the operator acting on it, 

will produce, apart from the ordering of the single vectors | + ) and | — ) , a state 
vector of the form 



/5 | x . 0 | , \ 

cos - | - > - sm - | + > 



J-N 



0 I , \ , • 0 I V 

cos - I + > + sm - I - > 



J+N 



J-N J+N 

= E E 

£=0 r=0 



J-N' 
t 



J + N 



(-1)* 



cos ■ 



P 



J-N-t+J+N-r 



sm ■ 



0 



t+r 



x|->|->”-|->l + >l + >-"l + > • 

S. > N y > 

( J— N— t+r)terms ( J+N— r+t)terms 

Whatever the ordering of the various indices, this state vector gives a non-zero 
scalar product with one and only one of the terms in the sum over p, provided that 



J-M = J - N -t + r 
J + M = J + N -r + t . 
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These equalities are both satisfied if r = N — M + t, and since the sum over q gives 
(2 J)! [( J + N)\ (J — N)\ ] _1 terms of the same kind 

d J MN (/?) = V(J + M)\ ( J - M)\ ( J + IV)! ( J - N)\ 

/ \2J+M-N-2t / \ 2 t-M+N 

( cos f ) ( sin f ) 

X Vf-lV - - 1 - ('2 69') 

^ ’ (J + M-t)\(J-N-t)\t\(t + N-M)\ ’ ; 

where the sum runs over all the t values which lead to non-negative factorials. 

The reduced rotation matrices have several important properties that can be 
easily deduced either from their definition or from their analytical expression. 

Reality: 

d'MN(P)* = 

Closure: _ 

Y. ^mjv(^i) diip(P 2) = d J MP {(3 1 + (3 2 ) 

N 

Symmetry: 

= ( _ 1) M N d J _ M _ N (f3) 

= (-l ) M ~ N d J N M 
= (-l ) M ~ N d J MN {-P) 

= (-l) J+M d^_ jv(7T-/?) 

= (— 1) J+JV d J M _ N {'K + (3) . (2.70) 

Explicit analytical formulae for J = 1 and 2 are given in Table 2.1. Additional 
formulae for J = 3 to 6 can be found in Buckmaster (1964, 1966). An interesting 
connection between reduced rotation matrices and Jacobi polynomials can be found 
in Edmonds (1957). 

Going back to the rotation matrices defined in Eq. (2.67), it is possible to establish 
for them a number of important properties. First of all, as the rotation operator 
D(R) is a unitary operator (D^(R) = D~ 1 (R)), we have 

V J MN {<»fo )* = V J NM {- 7 -13 -a) , (2.71) 

where (—7 —(3— a) are the Euler angles characterizing the inverse rotation of (a/3y). 

From the unitary character of the operator D{R) one gets the orthogonality 
relations 



Y, 'Dpn(R)* ’D'pm(R) ~ $MN 

p 

Y, V J mp(R) R >J np{R)* = d MN 



(2.72) 
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TABLE 2.1 

Algebraic formulae for d\ 1N {0) (top) and d? MN {0) (bottom). C and S stand for cos/3 and sin/3, 
respectively. 





N = -1 


o 

II 

fe; 


N = +1 


M = -1 


±(i+C) 


n/2 


O 


o 

II 





c 


J_C 

V2 b 


M = +1 


5(1 -C) 


_ 

^2* 


i(!+C) 





N = -2 


JV = -1 


o 

II 

fe; 


AT = +1 


N = +2 




l(l+C) 2 


|S(1 + C) 


x/P 2 


|S(1-C) 


i(i-c) 2 




-±S(1 + C) 


(C- f)(C + l) 


x/P C 


(C+f)(l-C) 


iS(l-C) 


o 

II 


x/P 2 


~x/P c 


i(3C 2 -l) 


x/P c 


x/P 2 


M = +1 


-iS(l-C) 


(C+|)(l-C) 


-y/lsc 


(C-±)(C + 1) 


iS(l + C) 


M = +2 


i(i-c) 2 


-iS(i-C) 


x/P 2 


-±S(1 + C) 


ia + c ) 2 



From Eq. (2.68), using the symmetry properties (2.70), it can be easily shown 
that 

V J mnWiT = V J _ M _ N (afa) . (2.73) 

If R is a rotation composed of two consecutive rotations R Y and R 2 , the operator 
D(R) can be written as the product D(R 1 ) D(R 2 ). 1 For the rotation matrices we 
thus have, directly from their definition 

V j mn (R) = (JM\D(R 1 )D(R 2 )\JNy = £ V J MP {R X ) V J PN {R 2 ) , (2.74) 

P 

This statement is not trivial. Indeed the operator D(R) should be written 

D{R) = D'(R 2 ) D(R 1 ) , 

where 

D’{R 2 ) = e - i “2 J z' e -i ^ 2 ‘V e~ i ' y 2 J z l , 

with J y , , J z , the components of J on the axes y', z ' obtained from y , z through the rotation R 1 . 
Since 

J y , = D(R 1 ) J y D-^RJ , J z , = D^RJ J z D~ 1 (R 1 ) , 
we have, by series expansion of the exponentials 

D'(R 2 )= [D(R 1 )e~ ia 2 J z D- 1 (33 1 )] [t ) ( 3? 1 ) e _i ^ 2 J w D _1 (i? 1 )] [d/RJ D" 1 ^)] , 

so that 



D(R) = D{R 1 )D{R 2 ) . 
































58 



CHAPTER 2 



the so-called closure property of the rotation matrices. 

Another important property follows from the addition rules of two angular mo- 
menta. Let us consider the product 

V MN( a 0l) V M'N'( a 0l) ; 

according to the definition of rotation matrices, this product can be written in the 
form 



<JM |e 



— i otJ — i f3J —by J 



\JN}(j'M'\e 



— i aJ — i /3J —i'y J 



j'N'y 



If we now consider the angular momentum operators appearing in the two matrix 
elements as referring to two different systems, we can write 

= (JJ’MM’ |e _iaJ ’ e _i7 ^| JJ'NN') , 



where the J-operator is the resultant of the angular momenta of the two systems. 
On the other hand, bearing in mind the coupling rules of two angular momenta 
and the definition of 3 -j symbols (Eqs. (2.12) and (2.22)), one obtains with easy 
transformations 1 



^MN( a 0'Y)’E > M'N , ( a ft'y) — 



= £(2tf + l) 

K 



( J 


j’ 


K \ 


( J 


J' 


K \ 


1m 


M’ 


Q 


u 


N' 


Q' 



VqQ'WiT , 



(2.75) 



a formula that shows how the product of two rotation matrices of rank J and J' 
can be reduced to the linear combination of rotation matrices having rank K such 
that \ J -J'\ < K < (. J+J '). 

From Eq. (2.75), using the orthogonality relation (2.23a) of the 3 -j symbols, the 
following expressions are also obtained 



2>qq'(«07)* = 

= E (2 k + i)( j m m> q){ J n N' o^ v i N ( a ^) v M' N '( a ^ ; 

MM' NN' ' ^ / \ ^ / 

E ^MJv( a /^7) 'D J M'N'{ a Pl) T ) M"N"{ a Pl) f 

AIM' M" ' 

( J J ' J"\ 

\N N’ N" ) ' 




Note that a formal summation over Q and Q' might be added in the right-hand side of 
Eq. (2.75). 




ANGULAR MOMENTUM AND RACAH ALGEBRA 



59 



Another important property of the rotation matrices is the so-called Weyl theo- 
rem 



7T Z7T 7T 

da j dq J d/3 sin (3T> J MN (af3'y)v{ I , N ,(al3'y)* = 



0 0 0 
8 tt 2 



2J + 1 



8 T T $ A/f A/f ' S 



J J' U MM' U NN’ ! 



(2.76) 



which is indeed valid only when J and J 1 are both integers or both half-integers. 
To prove the theorem, we note that Eqs. (2.73) and (2.75) give 



E( 2 ^+ 1 ) 

K 



( J 


J ' 




[ J 


J' 


K \ 


l M 


-M' 


Q/ 


U 


—TV' 


Q’ 



, (2-77) 



where, if we confine ourselves to one of the two cases specified above, the sum over 
K runs only over integral values. The integral in Eq. (2.76) is therefore reduced to 
the sum of various integrals of the form 



2tt 2ir 7 r 

J da J dy J d/3 sin/3 (a/3y)* , 
0 0 0 



with K , Q, and Q ! integers. The integrals over a and 7 can be easily performed 
(see Eq. (2.68)) and give the value 4tt 2 Sq 0 Sq, 0 . On the other hand, Eq. (2.69) 
yields 



(cos f ) (sin f) 



4(/3 ) = ( j A!) 2 E(- 1 )‘ 



2 * 



and since 

tt/2 

/ (~) 2 “ +, (cos*) 2 ‘ +1 d* = 2(a ° ! “ +1)! . 

0 

we have 

7T 

y<o(/3)sin/3d/3= =2 ^° • 

0 4 

Thus the integral in Eq. (2.76) reduces to 



87T 2 (- 1 ) 



M'-N' 



( J 


J' 




( J 


J' 


o\ 


\ M 


—M' 


0 J 


{ N 


-N' 


oj 
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or, using Eq. (2.26a), to 

871-2 * * r 
2 j _|_ i °JJ' °MM' °NN' ) 

which proves the Weyl theorem. 

This theorem can be generalized to the case where one of the angular momenta 
J and J' appearing in the rotation matrices of Eq. (2.76) is a half-integer while 
the other is an integer. In this case, however, the integrals over a and 7 must be 
extended to the interval (0, 47 t) instead of (0, 27 t), and the result is zero. Therefore 
the general form of the Weyl theorem, valid for any value of J and J' , is the 
following 



477 477 



^da Jo I7 J d/3 sm/3Vi [N (ap'y)v{ I , N ,(af3'y)* = 



32tt 2 
2J + 1 



djj' W ^NN' 



From the Weyl theorem in the form of Eq. (2.76) it is possible to find a highly 
symmetrical expression for the integral of the product of three rotation matrices 



2 7T 



2-7T 



f cla j dy J d/3 sin/3 D J ^ N ^(a/3'y) 'D J ^ N J y a(3')) 'D J ^ N J y a(3')) = 



= 8tt 2 



J, 



^2 


^3 


) ( J 1 


^2 


J3 | 


m 2 


-^3 , 


J [n. 


n 2 


n 3 J 



It should be remarked, however, that this expression is valid only when the three 
J’s are integers or when two of them are half-integers and the third is an integer. 
In different cases the expression in the right-hand side is meaningless. 



2.7. Irreducible Spherical Tensors 



In the ordinary three-dimensional space referred to a right-handed coordinate sys- 
tem, a Cartesian tensor of rank k is defined as a set of 3 fc quantities (called the 
components of the tensor) which, under rotation of the coordinate system, change 
according to the linear relationship 



T (*i, * 2 , • • ■ , 7 fc) 



«(*i. Ji) a (*2> J2) • • • a (Wfc) t (Ji> 32 . • • ■ > Jfc) 

3i 3 2 ‘“3k 

(*1, *2> • • • *fe) Jl) 72) • • • i3k = 1’ 2,3) , 



where T(J 1 ,j 2 , . . . ,j k ) and . . . ,i k ) are the tensor components in the old 

and new reference system, respectively, and where the coefficients a(q,j)) are the 
direction cosines of the new axes relative to the old axes. 
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For instance, the Cartesian components of a usual vector A satisfy a transforma- 
tion of the form 

A i = Y^ a ( i J) A j . 

3 

thus a vector is a Cartesian tensor of rank 1. If A and B are two arbitrary vectors, 
defining 

C ij = A i R j 

we have 

C[ j =Y j a{i,k)a{j,l)C kl , 

kl 

which shows that the dyadic product of two vectors forms a Cartesian tensor of 
rank 2. 

An irreducible spherical tensor of rank k is defined as a set of (2k + 1) quanti- 
ties (called the components of the spherical tensor) which, under rotation of the 
coordinate system, change according to the linear relation 

T ' k q =Y. T v V m( R ) (q,P=~k,-k+l,...,k) , (2.78) 

P 



where T'j and 7))' are the tensor components in the new and old reference system, 
respectively, and where R is the rotation that brings the old system into the new 
one. 

Contrary to the case of Cartesian tensors, the dyadic product of two irreducible 
spherical tensors no longer is an irreducible spherical tensor. In fact, if R* and S 
are two irreducible tensors, the (2k + 1)(2 k! + 1) components of the form R^S^, 
change, under the rotation R , into 



(k 


k! 


K '\ 


(k 


k' 


K '\ 


u 


V' 


Q'J 


u 


q' 


P'J 



(R k q s k q :y = E ( R p S p') V p q ( R ) R p-A R ) ■ 

pp' 

On the other hand, using Eq. (2.75) we can write 

( Rk q s k q :y = E ( r p s p')Y1 c 2k ' + x ) 

pp' K' 

Multiplying both sides by 

^_^k—k’+Q 

summing over q and q ' , and defining 

t q =Y(-v k ~ k ' +Q 



-) k ' 

J Q'F 



k k' I\ 

q q' -Q 



k k! K 

q q ' -Q 



(■ Rk A ') » 



(2.79) 
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we obtain, with the help of Eq. (2.23a) 

T, Q=E(^ , )(- 1 ) fc " fc ' +0 (p P> q) V Q'X R T 

= E T -Q' (-1) Q+Q ' Vq'-qW ■ 

Q' 

Recalling the conjugation property of the rotation matrices (Eq. (2.73)) and chang- 
ing the dummy index Q' in — P, we obtain for Tq the typical transformation law 
for spherical tensors, 

T'^ = Y, T p V pQ^ R ) ■ 

p 

Equation (2.79) can thus be regarded as a kind of recipe to obtain a spherical 
tensor of rank K from the dyadic product of two spherical tensors of rank k and 
k' . It should be noted, however, that definition (2.79) is by no means unique, as 
the multiplication of the right-hand side by a factor of the form f(K), with f an 
arbitrary function, does not affect the transformation law of the tensor Tq under 
rotations. 

The concept of spherical tensor can be easily extended to quantum-mechanical 
operators. A spherical tensor operator of rank k is defined as a set of (2 k + 1) 
operators (called the components of the tensor) which, under rotation of the coor- 
dinate system, change according to the linear relation (2.78). Since, on the other 
hand, quantum-mechanical operators are transformed according to Eq. (2.57), the 
components of a spherical tensor operator must satisfy the relation 

D{R) T k q Pt (p) = Y, T k V k pq (P) . (2.80) 

p 

From this equation it is possible to deduce the commutation rules of a spherical 
tensor operator with angular momentum. Let us consider an infinitesimal rotation 
of the coordinate system through an angle <f> about the unit vector it. We have (see 
Eq. (2.61)) 

D(R) = 1 - i <t>J u 
D\R) = l+i^J u 

V pq( R ) = <M! - i <t>J u \kq) = s pq - \4><kp\J u \kq) . 

Substitution into Eq. (2.80) yields 

[JuX] = E T P < k P\ J u\kq) , 

V 

and equating J u with J x , J y , and J z one gets (see Eqs. (2.7), (2.9), and (2.10)) 

[j z ,X = X 



[J ± , T k } = V(k±q+l)(k T q) T k ±1 



(2.81) 
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Conversely, these commutation rules could be taken as basic definitions of a spher- 
ical tensor operator. It can indeed be proved that from these equations one easily 
gets back transformation (2.78). 

From Eqs. (2.81) it is possible to find the spherical components, A l q , of a vector 
operator A. Equation (2.60) gives 



[J z , A z ] — 0 , 

hence we can write 

^in- 
setting k = 1 and q = 0 in the second of Eqs. (2.81) we further obtain 

V&M = V / 27l ±1 , 



so that, summarizing 



A\ = ~{A x +iA y ). (2.82) 



Given two tensor operators of rank 1, it is possible to construct three tensor 
operators of rank 0, 1, and 2, respectively, by considering the linear combinations 
(2.79). For the 0-rank tensor one gets, using Eq. (2.26a) 



rr lO 
1 0 









(2.83) 



which shows that, apart from a numerical factor (—1/^/3), the 0-rank tensor is 
simply the scalar product of the two vectors. 

For the tensor of rank 1, a direct application of Eq. (2.79), with the 3 -j symbols 
computed from Eqs. (2.26c,d), gives 

T\ = (Ro S -i R USD 

Tq = -j= (Rlsi, - RUSl) 

Tl = (RlS'o - RlSl ) , 
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or, in terms of Cartesian components (see Eqs. (2.82) and their inverse) 



T, = — ( R y S z - R z S y ) 

T y = ^ ( R Z S X - R X S Z ) 

T z = ^ ( R x S y - R y S x ) . 

Therefore the spherical components of the tensor of rank 1 determine a vector 
which, apart from a numerical factor, is simply the cross product of the vectors 
and S, 

T = -^= Rx S . 

Finally, the components of the tensor of rank 2 are given by 

7~>2 ^ pi cl 

~ 2 ~VE ~ v - 1 



rn2 

1 -1 _ 



1 



VTo 



{Ro^-i + Rhs]) 



rp2 _ J 

0 “ V30 

9 1 

rpZ 

1 ~7io 

rp 2 ^ pi ol 

2 ~V5 1 '■ 



{R 1 _ 1 S{ + 2R 1 0 Sl+RiS 1 _ 1 ) 



{RlS\ + R\Sl) 



(2.84) 



It should be remarked that the adjoint (or Hermitian conjugate) of a spherical 
tensor operator T q no longer is a spherical tensor operator. This can be easily 
proved by taking the Hermitian conjugates of Eqs. (2.81), recalling that 

[A,B]t = -[At,Bt] j (J ± )t = J=F ; 



we have 



[J z ,(T fc )t] = -g(T fc )t 



[■ J± , (2?) f ] - -V(*Tg + i)(fe±g) (^ Tl ) t . 

However, the operator 

where r is zero for /c integer and 1/2 for k half-integer, is indeed a spherical tensor 
operator. In fact 

[J z ,O k q ] = -(-1 r 9 = (-iy-*q{TH q )1 = qO k 

[J ± ,o k q \ = — (— 1 r« [J T) r* 9 ]t = (-!)'• "=V(A'Tr/- !)(/.- : . V ) (r* wl )t 



= \/(fc±g + i)(fcTg) 0„±i • 
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The spherical tensor operator can be considered as the Hermitian conjugate of 
the tensor employing for this tensor the notation T t we have 

(Tt) g fc = (-l)^(T^)t. (2.85) 

A Hermitian spherical tensor operator is defined as an operator such that 



(T f ) 9 fc = T ( 



as a consequence, the components of such operators satisfy the relation 

T* = (-l) r -« (r*,)t . (2.86) 



It is seen at once that the spherical components A l q of a Hermitian vector A form 
a Hermitian spherical tensor operator. From Eqs. (2.82) we have 

{AU = ±={A x +\A y ) = -A\ 

(A^ = A z = Al 

(A\)' = --L(A x -iA y ) = -Al i, 
so that Eq. (2.86) is satisfied. 



2.8. The Wigner-Eckart Theorem and its Consequences 

The Wigner-Eckart theorem has a fundamental importance in angular momentum 
theory, as it allows the matrix elements of spherical tensor operators to be reduced 
to simple mathematical expressions. 

For the evaluation of the matrix element 

(aJM\Tq\a' J' M ,s ) , (2.87) 

where T* is the (/-component of a spherical tensor operator of rank k, and where 
a and a' are supplementary quantum numbers (relative to Hermitian operators 
commuting with J 2 and J z ), we first consider the (2k + 1)(2 J' + 1) vectors 

| a' J'M'y , 

or, more precisely, their linear combinations 

T* | a'J'M'y (J'kM'q\J"M"} , (2.88) 

M'q 
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where J" is a given angular momentum eigenvalue having the same character (in- 
teger or half-integer) of (J 7 + k), and limited between \J' — k\ and (J' + k). We 
will now show that the (2 J" + 1) vectors obtained by varying M" in the linear 
combinations (2.88) are eigenvectors of J 2 and J z corresponding to the eigenvalues 
J"(J" + 1) and M", respectively. 

Denoting by |u(J",M")) the linear combination (2.88), we have 

J z \v(J",M")} = E J z T q \a'J'M'}(J'kM'q\J"M"y , 

M'q 

and using the commutation rules of spherical tensors (Eqs. (2.81)) we obtain 
J z \v(J", M")> = J2 ( M> + 9) T q I a'J'M'y < J'kM'q\ J"M") 

M'q 

= M" \v(J", M")} . (2.89) 



Similarly, one gets from Eqs. (2.81) and (2.9) 

J + \v(J",M")) = 

= J2 y/J'W + !) - M'(M' + 1) | a'J' M' + iy (J , kM , q\J"M"') 

M'q 

+ 22 VHk + 1) - q(q + 1) Tf +1 \a'J'M')(J'kM'q\J"M"y , 

M'q 

and renaming the summation indices 



J + |u(J",M")> = 



= E T q WJ’M’y 

M'q 



+ 1) - M'(M' - 1) < J’k M' — l q\J"M"y 



+ y/k(k + 1) - q(q - 1) < J’kM ' q-l\J"M"y 



which can be rewritten, with the help of the recursion relation (2.21), in the form 



J + \v(J",M")y = yJJ"{J» + 1) - M"(M" + 1) \v(J", M" + i)y . 

In a similar way one can prove that 

M")y = y/j"(j» + 1) - M"(M" - 1) \v(J", M" — i)y , 



so that, from Eqs. (2.89), (2.90), and (2.91) 



J 2 \v(J", M")> = 



Jz + o(J+J- + J-J+) 



\v{j",M")y 



= j”(j" + i)\v(j",M")y . 



(2.90) 



(2.91) 



(2.92) 
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Equations (2.89) and (2.92) show that the (2 J" + 1) vectors of the form 
\v(J",M")) are angular momentum eigenvectors (in general not normalized) cor- 
responding to the eigenvalues J" and M" . 

Recalling the definition of |«(J",M")) we have, after inversion of Eq. (2.88) via 
the orthogonality relations of the vector-coupling coefficients (Eq. (2.20b)) 

T k | a'J'M'} = (J'kM'q\J"M"y |u(J",M")> , 

J" 

and hence the scalar products (2.87) can be written in the form 

(aJM\Tq\a'J'M'y = ^ (J'kM'q\J" M"y <, aJM\v(J ", Af")> . (2.93) 

j" 

On the other hand, the scalar product (aJM\v{ J” , M")y is zero unless J = J" 
and M = M" . Moreover, it is independent of M (see the comments after Eq. (2.30)) 
so that we can write 

(aJM\v(J",M")y = 5jj„ S MM „ (— l) 2fc (a,J\\T k \\a' J’y , (2.94) 

where the quantity (aJ\\T k \\a' J'y is the so-called reduced matrix element 1 of the 
operator T k . Substitution of Eq. (2.94) into Eq. (2.93) leads to the final expression 
of the Wigner-Eckart theorem 

(atJM\Tq\a'J'M'y = (-l) 2k (aJ\\T k \\a' J'y (J'hM'q\JMy , (2.95) 

or, in terms of 3 -j symbols 

(aJM\T k \a'J'M'y = 

= (-1 ) J,+k+M V2J + 1 (_ J M J M , k ^j(aJ\\T k \\ a 'J'y . (2.96) 

The reduced matrix element can be expressed in terms of ordinary matrix ele- 
ments by inversion of Eq. (2.96). From the orthogonality relation of the vector- 
coupling coefficients (Eq. (2.20a)) we have 

<aJ||T fc ||a , J , > = (-l) 2fc ^ (J'kM'q\JMy(aJM\T k \a'J'M'y . 

M'q 

A different way for calculating the reduced matrix element is to substitute some 
special values of M and M' into Eq. (2.95). For the reduced matrix element of the 
angular momentum operator J we have for instance (denoting J 1 by J ) 

(aJM\j£\a f J' M f > = M 6 aa , Sjj, S MM , = (aJ\\ J ||a' J'> <J'1M'0| JM) , 

The definition given here for the reduced matrix element agrees with that given by Brink 
and Satchler (1968), while it differs by a factor y/2 J + 1 from those given by Racah (1942) and 
Edmonds (1957). 
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and being 

M 

(J1M0IJM) = ; 

VJ(J + 1 ) 

one obtains 

(aJ\\J\\a'J'y = 6 aa , 8 jj, y/j(J+l) . (2.97) 



An important property of the reduced matrix element can be easily deduced from 
the orthogonality relation of the vector-coupling coefficients 



E \(ocJM\T*\a'J’M'y \ 2 = |<aJ||T fc ||a , J / >| 2 . 

M'q 



The most striking consequence of the Wigner-Eckart theorem is the direct ap- 
pearance of selection rules for the matrix elements of the components of spherical 
tensor operators. Indeed, because of the presence of the 3 -j symbol in Eq. (2.96), 
the matrix element <( a J M \ T k \a' .V M ' ) is identically zero unless 

\J - k\ < J' < J + k , M' = M-q. 



In particular, we obtain from Eq. (2.26a) that the matrix elements of a zero-rank 
spherical tensor operator are diagonal with respect to J and M, and are indepen- 
dent of M. 



2.9. Properties of Reduced Matrix Elements 



From the Wigner-Eckart theorem an important relation can be deduced for the 
reduced matrix elements of a Hermitian spherical tensor operator . Taking the 
complex conjugate of Eq. (2.95) we have 

(aJM\H*\a'J'M'y = (-l) 2fc (aJ\\H k \\a' J'}* {J’kM'q\ JM) . (2.98) 

On the other hand, being H k Hermitian, from Eq. (2.86) we deduce 

(aJM\H k \a'J'M'y = (a' J' M'\(H k ^\aJM) 

= (-l) r+9 (a'J'M'\H k q \aJMy . 

Applying again the Wigner-Eckart theorem to the last matrix element we get 

(aJM\H k \a'J'M'y = (-l) r+g+2fc (a'J'\\H k \\a J > {JkM - q\J'M '} . (2.99) 

From the symmetry properties of the vector-coupling coefficients (Eqs. (2.22), 
(2.24), (2.25)) we have 



, , / O J' 1 

{JkM - q\J'M ') = ~ M+M J < J'kM'q\JM > , (2.100) 
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and, finally, from Eqs. (2.98), (2.99), and (2.100) we get 

V2J + 1 (aJ\\H k \\a' J'>* = (-1 )' J ~ J ' +r V2 J' + 1 (a' J'\\H k \\aJ} , (2.101) 



which is the relation, valid for Hermitian spherical tensor operators only, between 
reduced matrix elements of the form (aJ\\H k \\a' J'y and (a! J'\\H k \\aJ). 

In Sect. 2.7 we have shown how it is possible to construct an irreducible spherical 
tensor Tq from the product of two irreducible tensors R k and S k , . We want here 
to show how the reduced matrix elements of these tensors are related among each 
other. The Wigner-Eckart theorem, applied to the tensor Tq , gives 



(cxJM\Tq \a' J'M'y 



(-1)J' +k+m V2JTT^ J m J m , (aJ\\T K \\a'J'y . 



From Eq. (2.79), introducing intermediate states between the tensors R k and S k , 
and applying again the Wigner-Eckart theorem, one obtains 



Y Y (~i) j " +M " +2k ~ K+Q j" + 1 

qq' a." J" M" 



X 



( J J" k\( J" 

\ —M M" q J ^ -M" 



J' k'\ (k k’ K \ 
M’ q'){q q> -Q J 



= (-« m ' K Q )<°A\T«\wr>. 



Multiplication of both sides by 



f J J • K\ 

^ — M M’ Q J 

followed by summation over M and M' gives, using Eqs. (2.23a) and (2.34) 1 

{aJ\\T K \\a'J'y =(-l y+J'+K Y. V / 2J 77 +1 { 

a" J" ^ 

x (aJ\\R k \\a" J"y (a" J"\\S k ' \\a' J'y , ( 2 . 102 ) 

which shows how the reduced matrix element of a ‘composite’ spherical tensor is 
related to the reduced matrix elements of the constituent tensors. 



The deduction of Eq. (2.102) is not trivial, as it involves the contraction of four 3 -j symbols 
which must be suitably manipulated to obtain the formula (Eq. (2.34)) leading to the final result. 
In App. 2 we give, for the unexperienced reader, an example of how the various manipulations 
can be performed. 
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A direct application of this formula to the 0-rank tensor T° obtained from the 
scalar product of an arbitrary vector v with the angular momentum J gives (see 
Eq. (2.83)) 

(aJ\\T°\\a'J') = — \= < aJ\\v ■ J\\a'J '> 
v3 

= (-1) J+J ' V2 J" + 1 

a" J" 

which, using Eqs. (2.36a) and (2.97), can be written in the form 

(aJ\\v ■ JHa'J') = Sjj, \J J{J + 1) (a J\\v \\a' J) . 

From this equation a remarkable relation, often referred to as the projection theo- 
rem , can be easily proved by a double application of the Wigner-Eckart theorem, 

[J(J+ 1)] (aJM\v\a'JM'y = (aJM \ (v ■ J)J\a'JM ') . (2.103) 

We have indeed for the spherical components of the operator J 

<aJM| (v ■ J) Jg \a' JM'y = 

= <OiJM\v ■ J\a" J" M"y (a"J"M"\ j\ \ a'JM'} 

a" J" M" 

= (c*J\\v- J\\a , J)(J0M"0\JM)(a , J\\J\\a , J}(JlM , q\JM"y 

M" 

= J{J + 1 ) (aJ\\v\\a'jy (JlM'q\JMy 

= J{J + 1) (aJM | v\ | a' JM'y , (2.104) 

which proves Eq. (2.103). 

The projection theorem has a very simple physical meaning that is depicted in 
Fig. 2.5. According to the vectorial model, the vector v rotates rapidly about J, 
so that its only effective component, i7 efI , points in the J-direction. On the other 
hand, iT eff can be written in the form 



J J' 0 
1 1 J" 



<aJ||F||a"J"><a ,, J ,, ||J||a , J / > , 



'i’eff = 



(v ■ J)J 

J2 ’ 



thus the projection theorem can be considered as the direct generalization to Quan- 
tum Mechanics of this simple geometrical expression. 

The projection theorem can be expressed in the equivalent form (see Eq. (2.104)) 



(aJM\v\a! JM'y = g aa ,j(v ) < a'JM\J\a'JM'y , 
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Fig. 2. 5. The vector v is rapidly spinning about J, so that its only effective component, is in 
the J -direction. 



where 



9oca'j(v) 



(aJM\v- J\a'JM) 

J(J+ 1) 



(aJ\\ v\\a'jy 

\/J(J + 1) 



(2.105) 



is the generalization of the Lande factor that will be encountered in the next chap- 
ter. 

A final application of the Wigner-Eckart theorem concerns the matrix elements of 
a spherical tensor operator acting on the dynamical variables of a single component 
of a composite system. 

Let us consider a system composed of two parts of angular momenta and J 2 , 
respectively, having total angular momentum J = J x + J 2 . If T k ( 1) is a spher- 
ical tensor operator (the argument (1) means that it is acting only on the first 
component of the system), we have for its matrix elements 



(ajJ 2 JM\T k (l) \o! j[j ' 2 J'M'y = 



= (-l) 2fc (J'kM'q\JMy ( L aj 1 j 2 J\\ T k { 1) ||«' > . (2.106) 

On the other hand, if we change the coupling scheme using the vector-coupling 
coefficients, we can write 



= 51 0O2 m l m 2l JM )0li2 TO l TO 2l J,M ') 

m 1 m 2 m , 1 m , 2 

X <ajij 2 TO i TO 2 l T q (!) \oc j[j' 2 m\rri 2 ') . 

Since T k ( 1) acts only on the first component of the system, applying again the 
Wigner-Eckart theorem we obtain 

< a iii 2 TO i TO 2 1 Tq (1) \a j' 1 j 2 m' 1 m' 2 } = 

= (- 1 ) 2fe 0^'TOi9b'i TO iX«Ji!! X(l) II a'j[) 6m 2 m' 2 ■ 



(2.107) 
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Comparison of Eqs. (2.106) and (2.107) gives the following relation between reduced 
matrix elements 

(J'kM'q\JM)(aj lJ2 J\\ T k ( 1) || = 



= (jij2 m i m 2\ JM }(j[j2m[rn 2 \J' M '}{j[km' 1 q\j 1 rrii'> 

m 1 m 2 m' 1 

x <aj 1 ||T fe (l)||o'j()^ 2i , . 

This relation can be expressed in a more compact form by a calculation similar to 
that in App. 2. This calculation, which is left as an exercise to the reader, leads to 
the following expression 

<aj 1 j 2 J||T fc (l)||a'j(j'J') = 

= (-1 y^+ J '+ k yJ{2J' + l){2j 1 + l) | j 

x <aj 1 \\T k (l)\\a'j' 1 '>6 jaj , . (2.108) 

Similarly, if T k ( 2) is a spherical tensor operator acting only on the second part of 
the composite system, we obtain 

(a JlJ2 J\\ T k ( 2) ||a'j^J'> = 



fi k\ 

J k J 



= (-1 yi+j*+ J + k v /(2J' + 1)(2j 2 + 1) 



3-2 32 k \ 

J' J k J 



x (aj 2 \\T k i2)\\a'f 2 }S 1 



(2.109) 



/c k 

More generally, if R q * (1) and S q * (2) are two spherical tensor operators acting on 
the first and second part of the composite system, respectively, and if Tq is the 
spherical tensor operator constructed according to Eq. (2.79), the reduced matrix 
elements are connected by the following relation 



<akj 2 J\\ T K || = 

= v /(2j 1 + 1)(2j 2 + 1)(2J' + 1) 



J J' K ) 

h j'i K l 

3 2 3'1 k 2 J 



x £ (ajill fl fe i( 1) II a"j[y (a"j 2 \\ S k *( 2) ||a'j'> . 



The proof of this relation, that follows from Eqs. (2.34) and (2.48), is left as an 
exercise to the reader. 
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Polarization can be introduced in spectral lines by several different mechanisms, 
that are connected either with the presence of an external field (like for instance 
a magnetic or an electric field) or with the existence of some kind of anisotropy in 
the excitation of the atomic system (optical pumping, impact polarization, etc.). 

While the detailed theory describing the interplay of the various mechanisms 
will be tackled in the following chapters, it is first necessary to state some specific 
spectroscopic notations, and to give a number of basic results concerning the inter- 
action of an atomic system with an external field, in order to establish an adequate 
physical background. In particular, we will review in this chapter the theory of 
the Zeeman and Paschen-Back effects, including the case of hyperfine structure, 
and we will give a description of atomic polarization in terms of the density (or 
statistical) operator and of statistical tensors. 



3.1. Zeeman Effect 

According to the theory of atomic structure (see for instance Condon and Shortley, 
1935; Condon and Odaba§i, 1980), the modifications produced in atomic spectra by 
an external, uniform magnetic field can be described by adding to the unperturbed 
Hamiltonian H 0 of the atomic system an additional term H B , called the magnetic 
Hamiltonian , given by 

ff B =-^(i + 25)T+-i(Bxf) 2 , (3.1) 

47rmc amc 

where e 0 is the absolute value of the electron charge, m is the electron mass, h and 
c have their usual meaning of Planck constant and speed of light, L and S are the 
(dimensionless) total orbital angular momentum and total spin of the electronic 
cloud, B is the magnetic field vector, and r is the position operator defined by 




l 



where f) is the position of the 1-th optical electron relative to the nucleus. For one- 
electron atoms, Eq. (3.1) can be derived by taking the lowest-order, non relativistic 
limit of Dirac’s equation describing the motion of a particle in an electromagnetic 
field (see for instance Sclriff, 1949). 

The second term in Eq. (3.1) is the so-called diamagnetic term. Its importance 
is very limited in practice, at least for the magnetic fields that are usually found 
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in laboratory or astrophysical plasmas. Indeed, comparing the two terms in the 
right-hand side of Eq. (3.1) and taking, as an order of magnitude, the radius of the 
first Bohr’s orbit for the expectation value of r, we obtain for their ratio 



second term 
first term 



1 h 3 
32ir 3 m 2 ceQ 



B = 1.06 x 10 ~ W B , 



(3.2) 



where B is expressed in G. If we exclude the extremely large fields that are found in 
magnetic white dwarfs (see e.g. Angel et al., 1981, or Schmidt, 1987, for a review 
on the subject) or are supposed to exist in neutron stars, the diamagnetic term can 
be safely neglected in Eq. (3.1), so that we can write 



H s = £^(L + 2S)-B = li 0 (L + 2S)-B , (3.3) 

where /i 0 is the so-called Bohr magneton (/i 0 = 9.27 x 10 -21 erg G -1 ). 

If the magnetic field is so weak as to keep the magnetic energy much smaller than 
the energy intervals relative to the unperturbed Hamiltonian H 0 , the effect of H B 
can be computed by perturbation theory (see Messiah, 1961, for an introduction to 
the argument). According to this theory, the shifts of an iV-fold degenerate energy 
level of H 0 are obtained by diagonalization of the matrix 

(# b )« = ^ ] \H b \ «<“>> = 

where (uj) 1 ' 1 ) (k = 1, . . . ,N) are the degenerate eigenvectors of H 0 corresponding 
to the eigenvalue E a . 

In our case, since the Hamiltonian H 0 is invariant under rotations, the total 
angular momentum J and its projection M along an arbitrary z-axis are good 
quantum numbers, so that the eigenvectors can be written in the form | aJM'y 
(where a is a collection of inner quantum numbers of H 0 ), with 



H 0 \aJM} = E aJ | aJM} 

J 2 \aJM) = J(J + 1) | aJM} 

J z \aJM > = M | aJM) . (3.4) 



When we add to H 0 the magnetic Hamiltonian H B , the corrections to the degen- 
erate energy E aJ are found by diagonalization of the matrix 

(aJM\H B \aJM') = /j, 0 (aJM\ (L + 2S) ■ B \aJM’} . 

The matrix elements can be evaluated for any direction of the magnetic field in 
the reference system ( xyz ) chosen to describe the atomic system. Writing B in the 
form B = Bb , where b is a unit vector in the magnetic field direction, and writing 
(L + 2S) in the form (J+5), where J is the total angular momentum of the atomic 
system, we have 

(aJM\H B \aJM’} = n 0 B (aJM\ (J + S) ■ b\aJM ') . 
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Introducing the spherical components of the three vectors, the scalar product can 
be written in the form (see Eq. (2.83)) 

(J + S)-b = Y,(-l) q (J q + S q )b_ q , 

g 



so that, applying the Wigner-Eckart theorem in the form of Eq. (2.96), and using 
Eq. (2.97), one obtains 

(aJM\H B \aJM'y = g 0 B ^ (-l) J+M+9+1 v / 2jTl 

Q 

x (yj(j+ 1) + (aJ\\S\\ajyy_ q . (3.5) 

Owing to the selection rules of the 3 -j symbol, these matrix elements are zero 
unless (M' — M ) = 0, or ±1. We have therefore a tridiagonal matrix that might 
in principle be diagonalized by standard methods. 

The diagonalization can however be avoided by aligning the 2 -axis of the reference 
system with the direction of the magnetic field. In this case, in fact, the only non- 
zero component of the unit vector b is b (i = 1, and we easily obtain, substituting 
the value of the 3 -j symbol given by Eq. (2.26d) 



/ J J !\ 
\-M M' q) 



(aJM\H B \aJM'} = n 0 B g M S MM , , 



where g, the so-called Lande factor, can be written in the form 1 



<oJjj5JjoJ> 

yfjUTT) 



(JyfO). 



(3.6) 



(3.7) 



Equation (3.6) shows that the magnetic Hamiltonian is now represented by a di- 
agonal matrix. This means that, to the first order of perturbation theory, the 
eigenstates of the total Hamiltonian (H 0 + H B ) are of the form | aJM), while the 
eigenvalues are given by 



E a j + Mo gBM (M — — J, — J + 1, . . . , J). 

Thus the Hamiltonian H B removes the degeneracy, and any level characterized by 
the quantum number J is split into (2 J + 1) equally spaced sublevels, the splitting 
being proportional to the Lande factor g and to the magnetic field. 

We want to remark, however, that the states \a JM) are eigenvectors of the total 
Hamiltonian only when the 2 -axis of the reference system points in the magnetic 



As apparent from the 3 -j symbol in Eq. (3.5), the matrix elements of H g identically vanish 
if J = 0. This simply means that the magnetic field does not affect, to the first order, the energy 
of the atomic levels characterized by J = 0. Consequently, one can formally put g = 0 for such 
levels. 
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field direction. This means that a state having a definite value for the projection 
of the total angular momentum along the direction of the magnetic field has a 
well-defined energy value. When the z-axis of the reference system is not in the 
direction of the magnetic field, a state of the form | aJM') is an eigenvector of H 0 , 
but not of (H 0 + H b ). 

The general expression for the Lande factor, given in Eq. (3.7), involves the 
evaluation of the reduced matrix element of S. This can be easily done when 
the levels are described by the Russ el- Saunders (or L-S ) coupling scheme (see 
e.g. Condon and Slrortley, 1935). In this scheme the quantum numbers L and 
S corresponding to the total orbital angular momentum and to the total spin, 
respectively, are good quantum numbers, so that an eigenvector of the Hamiltonian 
H 0 can be written in the form 

| /3LSJM) , 

where (3 is a collection of quantum numbers representing the electronic configura- 
tion. The reduced matrix element is then 

(f3LSJ\\S WfiLS-jy , 

and it can be evaluated in the easiest way with the help of the projection theorem. 
From Eq. (2.105) we have 



(0LSJ\\S\\13LSJ) = _ 

and writing S • J in the form 

S- J = i [,J 2 + S 2 -L 2 ] 

we obtain for the Lande factor 

n ( 1 J (J + 1) + S(S + 1) — L(L + 1) 

9ls = + 2 J(J + 1) ■ 

This formula can be rewritten in the form 



(3.8) 



9ls = 1 + 7(-A S', L) , 



where we have used the compact notation 



7 (A,B,C) 



A(A + 1) + B(B + 1) - C(C + 1) 
2A(A+1) 



In the j-j coupling scheme (see e.g. Condon and Slrortley, 1935) a simple expres- 
sion for the Lande factor can be obtained for levels originating from two optical 
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electrons of orbital angular momentum and l 2 , respectively. In this case the 
reduced matrix element to be evaluated is the following 

(ft (^2 s 2)j2> J \\ S II ft Gl s l)jli (^2 S 2)j2> i 

where s 1 = s 2 = 1/2. Writing S in terms of the spins of the single electrons 
(5 = + s 2 ) and applying Eqs. (2.108), (2.109), and (2.36d), one obtains after 

some calculations 

9jj = 1 + J 2 ) 7(in 1/2, ii) + 7( J, J2)ii) 7(j 2 ; !/2, 1%) • 

Another case where it is possible to find an analytical expression for the Lande 
factor is the case of the so-called J 1 -l coupling. In this coupling scheme a ‘parent’ 
level of orbital angular momentum L 1 and spin S 1 couples its total angular mo- 
mentum J-l with the orbital angular momentum l of a further electron, to give an 
angular momentum K which in its turn couples with the electron spin to give the 
total angular momentum J. Thus the reduced matrix element to be evaluated is 
the following 



(ft ((L 1 S 1 )J 1 ,l)K, s, J\\S\\P s, J) , 

where s = 1/2. After some tedious algebra, that is left as an exercise to the reader, 
one gets the formula 

9jil = 1 + 7 (J, 1/2, K) + 7 ( J, AT, 1/2) j(K, J v l) j(J 1 ,S 1 , L x ) . 

In more complicated coupling schemes, or in intermediate coupling , no simple 
analytical expression can be found for the Lande factor unless the eigenfunction 
of the level is known in full detail. If we know, for instance, how to express the 
eigenvector |aJM) of a given level in terms of the L-S coupling eigenvectors (see 
Condon and Slrortley, 1935, for further details), 

| aJM) = c(/3, L, S) \/3LSJM > , 

pLS 

it can be shown by some Racalr algebra that 

Sic = Y m L ,s)\ 2 g L s . 

(3LS 

where g LS are the Lande factors computed from Eq. (3.8). This formula is however 
of little use, because the coefficients are seldom known with a sufficient degree of 
accuracy. 

It should be remarked, however, that in many cases the Lande factors are known 
from experimental work carried out in spectroscopy laboratories. A systematic 
(but incomplete and non-updated) list of experimental (/-values can be found in 
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Moore (1949, 1952, 1958). For the Fel and Fell spectra, further g- values can be 
found in Reader and Sugar (1975) and in Corliss and Sugar (1982). 

For an atom in L-S coupling, the Lande factor is equal to 1 for a level having 
S = 0, while it is equal to 2 for a level having L = 0. When both L and S are 
non-zero, g is generally bounded between 1 and 2, although many exceptions are 
found to this simple rule, especially for low ./-values. In rare cases, the combination 
of the quantum numbers may give a negative or vanishing value for g. L-S levels 
having g = 0 are 4 D 1 ^ 2 , 5 F lt 6 G 3 / 2 , 7 H 2 , 8 I 5 / 2 > etc., while L-S levels having g < 0 
are 6 F 1 ^ 2 , 7 G 1 , 8 G 1 / 2 , 8 H 3 / 2 , etc. In other cases, large values of g can result; for 
instance, L-S levels having g > 2.5 are 4 P 3 / 2 ( 9 = 2.667), 5 P 3 (g = 2.5), 6 D 1 / 2 
(g = 3.333), 7 D 3 (g = 3), 8 D 3/2 (g = 2.8), 8 F 1/2 (g = 4), etc. 

Let’s now consider the Zeeman pattern to be expected in the transition between 
two atomic levels, both split by the presence of a magnetic field. If J and J' are the 
angular momentum quantum numbers of the lower and upper level, respectively, 
and if g and g' are the corresponding Lande factors, the spectral line originating 
from the transition between the two levels splits, owing to the magnetic field, into 
a collection of components whose frequencies are given by 

v m'm> = "o + i ~y~ W M ' - 9 M ) . (3-9) 

where v 0 is the frequency of the unperturbed line and where M and M' are the 
magnetic quantum numbers of the lower and upper sublevels, respectively. Formula 
(3.9) is often written in the form 

i = ^0 + "l C g' M ' ~ 9 m ) . 



where 

u _ 9 qB _ e p B 
L h Annie 

is the so-called Larmor frequency, numerically we have 



= 1.3996 x 10 6 R , 



(3.10) 



(3.11) 



with B expressed in G and in s _1 . 

The wavelengths of the transitions can be easily evaluated in the visible and 
infrared, since in these cases the Larmor frequency is much smaller than u 0 (for 
instance, for a line at 1.2 jim and for B = 10 4 G, the ratio Pl/pq °f order 6 x 10 -5 ), 
so that we can write 



^ mm 1 — 2\A b (g M gM) , 



(3.12) 



where A 0 = cjv 0 is the wavelength of the unperturbed line, and where 



A\ _ \ 2 e o B 

XB A ° C 47TTOC 2 ' 



(3.13) 
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Numerically we have 

A\ b =4.6686 x 10~ w XlB , (3.14) 

where A\ B is in mA, A 0 in A, and B in G. 

It is important to note that, of the various Zeeman components (whose wave- 
lengths are given by Eq. (3.12)) arising in a given transition, only a limited number 
is actually observed: this is due to selection rules that depend on the interaction 
between atoms and radiation field. The general properties of Zeeman patterns 
(namely the number of observed lines, their wavelength position, their relative 
strength and polarization features) depend on the kind of interaction (electric- 
dipole, magnetic-dipole, electric-quadrupole, etc.). In the rest of this section we 
summarize the main properties of the Zeeman patterns resulting from the electric- 
dipole interaction, which is the simplest one and the most important in practice. 
The reader is referred to Sect. 9.1 for a formal derivation of what is anticipated 
here. 

The selection rule for electric-dipole transitions is 

AM = M' - M = 0, ±1 , (3.15) 

from which three distinct groups of Zeeman components arise. 

The components having AM = — 1, that will be called in the following the a r 
components, are generally displaced to longer wavelengths (or towards the red side 
of the spectrum) from the unperturbed line. In emission, they give rise in general 
to elliptically polarized radiation, which degenerates into circularly polarized ra- 
diation when observed along the direction of the magnetic field, and into linearly 
polarized radiation when observed in the plane perpendicular to the magnetic field. 
In particular, referring to Fig. 3.1, any a T component produces in emission: right- 
handed (or positive) circular polarization along the positive 2 -axis, left-handed (or 
negative) circular polarization along the negative 2 -axis, and linear polarization, 
perpendicular to the 2 -axis, along any direction in the x-y plane. 1 

The components having AM = +1 (cr b components) are generally displaced to 
shorter wavelengths (or towards the blue side of the spectrum) from the unper- 
turbed line. Their behavior is similar to that of the a T components except for the 
handedness of circular polarization. Referring again to Fig. 3.1, any a b component 
produces in emission: left-handed (or negative) circular polarization along the pos- 
itive 2 -axis, and right-handed (or positive) circular polarization along the negative 
2 -axis. In the x-y plane the cr b components have the same properties as the a T 
components. 

Finally, the wavelengths of the components having AM = 0 (the so-called n 
components) fall in between those of the a r and a b components. In emission, the 



In absorption, the situation is more complicated because the polarization of the radiation 
absorbed by the <r r components depends also on the polarization of the incident radiation. For 
the special case of an unpolarized incident beam, the absorbed radiation has the same polariza- 
tion characteristics as the radiation that would be emitted in the direction of the beam. As a 
consequence, the opposite polarization will be present in the beam after absorption. 
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Fig. 3.1. The polarization properties of the radiation emitted by different Zeeman components 
depend on the type of component and on the angle between the observing direction and the 
magnetic field vector. For the radiation emitted along the x, y, and z-axes, the tip of the wave 
electric field vector draws, in a fixed point, the figures here shown. 

7t components give rise to linear polarization whose direction is always parallel to 
the magnetic field, as represented schematically in Fig. 3.1. 

In particular, for an observer located in the x-y plane, all the components pro- 
duce linear polarization that is parallel (for the 7 r components) or perpendicular 
(for the a components) to the direction of the magnetic field. This explains the 
denominations 7r and cr, which follow from the German words parallel and senkrecht 
(perpendicular) . 

As far as the number of Zeeman components is concerned, the simplest case (often 
referred to as normal Zeeman effect) occurs when the angular momentum of any 
of the two levels involved in the transition is zero (J = 0 or J' = 0), or when both 
levels have the same Lande factor ( g = g'). 1 In both cases only three components 
are left: one a r component at the wavelength (A 0 + gAX B ), one cr b component at 
the wavelength (A 0 — gAX B ), and one n component at the wavelength A 0 , where 
the (/-factor is either the Lande factor of the level having J ^ 0, or the Lande factor 
common to the two levels. 2 



The two cases will be referred to in this book as normal Zeeman triplet and anomalous 
Zeeman triplet , respectively. Other authors reserve both names for the case (J = 0 or J' = 0), 
and use normal or anomalous according as the Lande factor is unity or not. 

The normal Zeeman effect (or, more properly, the normal Zeeman triplet case) can also 
be treated by a classical (non quantum- mechanical) approach, that will be presented in the next 
section. 
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TABLE 3.1 



Analytical expressions for the strengths of Zeeman components. J and M are the quantum 
numbers of the lower sublevels, J' and M' the quantum numbers of the upper sublevels. 





J' = J + 1 


j' = j 


1 

II 


a b 


M' = M + 1 


3(J+M+l)(J+M+2) 


3(J-M)(J+M+1) 


3(J-M)(J-M-1) 


2(J+l)(2J+l)(2J+3) 


2J(J+1)(2J+1) 


2J(2J-1)(2J+1) 


7 T 


M' = M 


3(J-M+1)(J+M+1) 


3 M 2 


3 (J-M)(J+M) 


(J+l)(2J+l)(2J+3) 


J(J+1)(2J+1) 


J(2J-1)(2J+1) 


<7 

r 


M' = M - 1 


3(J-M+l)(J-M+2) 


3(J+M)(J-M+1) 


3(J+M)(J+M-1) 




2J(J+1)(2J+1) 


2J(2J-i)(2J+l) 



In the other cases (referred to as anomalous Zeeman effect), more complicated 
patterns occur. It will be proved later (see Sect. 9.1) that the relative strengths of 
the various components are given by the expression 

S J /(M,M') = J m (q = —1, 0, +1) , (3.16) 

where the index q = — (M' — M) = — AM is equal to —1, 0, +1 for the cr b , 7 r, and 
cr r components, respectively. It can be easily seen that the relative strengths, as 
defined by Eq. (3.16), are normalized to unity; from Eq. (2.23a) we have in fact 

ES g JJ '( M , M ')=E 3 (-M' M -q ) 2 = 1 (<? = -!, °,+l). (3.17) 

MM' MM' ' ^ ' 



Using the analytical expressions of the 3 -j symbols (Eqs. (2.26)), the strengths of 
the various components can be written as algebraic functions of M and M'. The 
results are summarized in Table 3.1. 

From Eq. (3.16) one can prove some symmetry properties for the strengths. Equa- 
tions (2.24) and (2.25) give 



S J / ( M , M') = S J _ J q (— M, -M') (3.18) 

S J q J '(M , M') = S J q ' J {-M', —M) . (3.19) 



On the other hand, defining the wavelength displacement from line center via 
Eq. (3.12), 



we also have 



'MM' ~ ~A\ b ( g'M ' gM) , 


(3.20) 


A\JJ' — A\JJ' 


(3.21) 


1 

^ 1 
II 

T, % 

<1 


(3.22) 



Equations (3.18) and (3.21) show that for any cr r (<r b ) component there is a a h 
(cr r ) component of the same strength symmetrical about line center, and for any 
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7 r component (except the one corresponding to M = M' = 0, if it exists) there is 
another 7r component of the same strength symmetrical about line center. From 
Eqs. (3.19) and (3.22) we further deduce that the inversion of the upper and lower 
level in a given transition yields the same combination of strengths and splittings 
of the various components, or, in other words, the same Zeeman pattern; the 
component connecting, in the original pattern, the lower sublevel ( </, M ) with the 
upper sublevel ( J' , M') corresponds in the ‘inverted’ pattern to the component 
connecting the lower sublevel (J 7 , —M r ) with the upper sublevel (J, — M). 

Some representative Zeeman patterns are shown in Fig. 3.2. An extensive numeri- 
cal table of Zeeman patterns for a large number of electric-dipole atomic transitions 
has been given by Beckers (1969b). 1 



3.2. Classical Theory of the Zeeman Effect 



According to the classical theory of the electron, as developed by Lorentz in the 
early years of this century, we schematize the emitting atom as a negative electric 
charge (-e 0 ) oscillating, at the frequency v 0 , around a point P under the action of 
an elastic, restoring force. In the presence of a magnetic field B , the motion of the 
electron is described by the equation 



d 2 x 
c It 2 



. 22 -* e 0 dx — ~ 

= -n„ 0 x---xB- 7 -. 



cla? 

dt 



(3.23) 



where 7 is a constant which accounts for the damping of the electron due to its 
energy loss by radiation. 

The actual value of this constant, that is introduced here in a purely phenomeno- 
logical way, can be deduced by equating the work per unit time done on the electron 
by the friction force 



dL 

dt 



= mr y 




2 



with the energy loss by radiation per unit time 



W = 



2e 2 

ze o 

3 c 3 



d 2 x 

dt 2 



By averaging these quantities over an oscillation period, and disregarding the slight 
difference - due to the magnetic field between the actual oscillation frequency 
and u 0 , we have 



87 t 2 eg 2 8 tt / 7 reg\ 1 
3 me 3 3 \ me ) Ag 



(3.24) 



In Beckers’ table the Lande factors of the levels involved in the transitions are calculated 
according to the L-S coupling scheme (Eq. (3.8)). 
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Fig. 3. 2. Characteristic Zeeman patterns for different transitions. The Lande factors are computed 
according to the L-S coupling scheme. Following the usual convention, the n components are 
drawn upward and the cr components downward. The <7 b components in panels (c) and (f) are 
dashed for clarity; the a and <r b components at line center in panel (c) are drawn somewhat 
apart but they actually coincide. From left to right, top to bottom, we have patterns of Type 0 
(a), Type II (b), Type III (c,d), and Type I (e,f) (see Sect. 3.3 for the definition of Type). 
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To find the solution to Eq. (3.23) we introduce a real unit vector u 0 parallel to 
B and two real unit vectors u r and u s in the plane perpendicular to B , in such a 
way that (u r ,u s ,u 0 ) form, in this order, a right-handed coordinate system. From 
u r and u s we next define two complex unit vectors u ±1 by the equation 



Tti — ^ r 



,)• 



Expanding x on the basis u a (a = 0, ±1), 



(3.25) 



E 






and noting that 



we obtain for x a the following decoupled equations 



(3.26) 



Ek = -47r 2 ^a: a +AmavJ^± - 7 Es. (a = 0,±l), (3.27) 

where is the Larmor frequency defined in Eq. (3.10). If we now look for a 
solution of the form 

, — 27ri u t 

x a = A a e 

and observe that the quantities and 7 are generally much smaller than the 
frequency Pq , 1 we obtain 



= 4 



— 27 ri (i/ Q — <217 



)i ~%t 
e 1 



(3.28) 



where the constants A a are to be determined from the initial conditions and depend 
on the excitation mechanism of the classical dipole. 

To determine the radiation emitted by our classical model atom we have just to 
recall some important results from classical electrodynamics. It is well-known (see 
e.g. Jackson, 1962) that an oscillating, monochromatic dipole p(t) produces in the 
radiation zone an electromagnetic wave whose frequency is the same as that of the 
dipole, and whose electric field is described by the equation 

ikr , 2 i kr 

£ (r, O, t ) = k 2 - — x p (t)^ x fl = — - — P±(t) , (3.29) 

where k is the wavenumber, r is the distance from the dipole, 12 is a unit vector in 
the direction of propagation, and p ± = p— 12 (12 -p) is the component of the dipole 

For instance, for a spectral line at 5000 A and a magnetic field of 10 3 G, we have (see 
Eqs. (3.11) and (3.24)) 

z/ L ~ 1.4 x 10 9 s _1 , 7 ~ 8.9 x 10 7 s _1 , ~ 6.0 x 10 14 s -1 . 
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Fig. 3. 3. Geometry for calculating the polarization properties of the radiation emitted by a classical 
dipole in a magnetic field B. The unit vector e 1 is the reference direction for defining the Stokes 
parameters. 

in the plane perpendicular to f 1. From this equation it is possible to determine 
the polarization of the electromagnetic wave emitted by a classical dipole in any 
direction. 

To define the Stokes parameters (see Sect. 1.6) we choose, in the plane perpen- 
dicular to fi, the reference direction unit vector e\ and the associated unit vector 
e 2 , oriented as shown in Fig. 3.3. Since the oscillating dipole is p = —e 0 x, the 
components of the vector p ± along the directions e) and e 2 can be easily obtained 

P±i(t)=Pi(t) = ~e (i = 1,2), (3.30) 

a 

where the direction cosines C ai are defined by 1 

C ai = u a ■ e* . (3.31) 

We must consider, however, that the dipole oscillation is not monochromatic. By 
Fourier series expansion we get 



PM = -e 0 5] C ai 4, / e du , (3.32) 



Indeed, in this formula, the direction cosines could be just defined as C . = u a • e^, since 
the unit vectors e 1 and e 2 are real. However, Eq. (3.30) and the following ones up to Eq. (3.37) 
are valid also when the unit vectors e l5 e 2 are of the more general form of Eqs. (1.41). 
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where 



F a ( v) = e 0 L e 2 dt 
o 



i 1 

2ir (v 0 — au L — v) — \ r 



(3.33) 



with r = 7/47T. Thus the component of the electric held along the unit vector e i 
can be written in the form (as k = 2 tiv / c) 



£i(r,n,t) 



47r 2 e 

rc 2 



o 




o \{kr—2 , Ki't) . 

v e du 



and hence 



ere. 



167T 4 en 



Y, C L C 03 A « A 8 

a (3 



oo oo 

x j du J d u 1 F a {v)* F 0 {v 0 P 2 v’ 2 e l{k '- k)r . 



— oo — oo 



To obtain the polarization tensor we must take the average of this quantity over a 
time interval r sufficiently long as specified in Sect. 1.4. This leads to 

t/2 

,- 2 .i {V '-U)u = 1 Hm f e -2ni(,'-Ot dt= l 6{iy _ ul) 

' T t— oo J T 

-r/2 



with S(x) the Dirac delta-function, so that (see Eq. (1.25)) 1 

OO 

Ju A F a (u)*F fi (u)du. (3.34) 



J f r '| = 16 ^ 4e o V C* C 
J ijV) r 2 c 4 / j U ai 

Oi(3 



The quantities A a depend on the initial conditions for the motion of the oscillator. 
For random initial conditions (as one would have, for instance, if the oscillator 
were collisionally excited by a population of perturbing particles with isotropically 
distributed velocities), these quantities are uncorrelated with each other, so that, 
performing a statistical average over the initial conditions, we can replace A* a Ap 
by \A | 2 5 a p. On the other hand, the quantity |^1| 2 is related to the average energy 
E contained in each of the three degrees of freedom of the oscillator. In fact, since 



Note that this formula contains the normalization time r which is not related to any physical 
quantity. We will see shortly (Eq. (3.35)) how this quantity disappears from the final results. 
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for a classical oscillator the average kinetic energy is equal to the average potential 
energy, writing A a = \A\ e 1 ^ we have from Eq. (3.28) 

E = 47 t 2 Pq m [Re(x Q ,)] 2 = 47 t 2 Pq m |A| 2 cos 2 [27r(zz 0 — «p L )i — (f>\ e^ 7 * , 

and being the time interval t much larger than the decay time 7 -1 of the oscillator, 
we can write 






E = 47 t 2 Pq m - |A| 2 - lim /e d t = 

2 T T — KX> / 



- 7 t 27T 2 t'Q m 



o_21 TH2 



(3.35) 



Let us now introduce the emission coefficient in tensorial form, e i3 ( v, f l). Assum- 
ing N oscillators per unit volume, we have (see the analogous Eq. (1.31)) 



£■■ (p, fl) dp dtdQ = — J- • (r) IV df r 2 clfl , 
3 J 87 r 3 



(3.36) 



and using Eqs. (3.34) and (3.35) we get 



M"’ = 3 "0 7^ 53 ^ ^ \ F »\ 2 • 



which via Eq. (3.33) can be written in the form 



2 2 
TreS , T K 



SiAv, f2) = — - N -j E 53 C^i C aj - - 
13 K ’ me c 2 ^ m aj 71 - (z. 



- p) 2 + r 2 ' 



(3.37) 



From Eqs. (1.39), defining the profiles </> b , (f> , <j> T (normalized to 1 in frequency) as 



* K + - ^) 2 + r2 

1 r 



7 r (p 0 - p ) 2 + r 2 



r tt (p 0 - P L - p ) 2 + E 2 ’ 

and expressing the direction cosines for the geometry specified in Fig. 3.3, 



(3.38) 



C ±l 1 = — = ( i cos 0 cos <j) + i sin 9 sin <j>) 
V 2 

C 0 i = — sin# 

C ±1 2 = — (± sin (j) + i cos <(>) 
v 2 

Eq 2 = 0 , 



(3.39) 
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we finally obtain the emission coefficient in the four Stokes parameters 



£i(is,n) = + £ 22 (v,Q) = k- 



= e n (v, SI) - e 22 {v, Q) = k - 



1 

2 TP 
l r 



4> v shr# - 

’b 



(1 + cos 2 9) 



2 TP 



surf? 



£u{v, $1) = e 12 (v, Q) + e 21 (v, ft) = 0 
£ v (iy, ft) = i(e 12 (i', ft) - e 21 (v, ft)) = fc - 



cos 9 



(3.40) 



where 



k = 



net 



N ■ 



2Un -E. 



These formulae show that our collection of randomly excited classical model 
atoms emits along B (9 = 0) a radiation that is right-handed (or positively) circu- 
larly polarized around v = (p 0 — i/ L ) (in the ‘red’ component cr r ) and left-handed 
(or negatively) circularly polarized around v = (p 0 + p l ) (in the ‘blue’ component 
tJ b ). Obviously, the handedness of circular polarization changes when the direc- 
tion is reversed ( 6 = ^ r). For 9 = 7t/ 2, on the contrary, the emitted radiation is 
linearly polarized. The direction of linear polarization is parallel to the magnetic 
field in the it component and perpendicular to the magnetic field in the <r b and a r 
components. 

To sum up, we see that the classical theory provides a satisfactory explanation 
for the normal Zeeman effect with respect both to the wavelength separation of the 
Zeeman components and to their polarization characteristics. In the classical theory 
the frequency separation of the components is found to be p l , which corresponds 
to a Lande factor g = 1. This is quite obvious since (/-values other than 1 are 
produced by the spin (see Eq. (3.7)), that cannot be accounted for by a classical 
theory. 

We also want to remark that the results obtained for the polarization of the 
various components depend on the negative sign that we have assumed for the 
electric charge (see Eq. (3.23)). Assuming a positive charge for the electron, has 
to be replaced by in Eqs. (3.27) and following, and a further sign change must be 
performed in Eq. (3.30). This last change is irrelevant, as the Stokes parameters 
are quadratic functions of the dipole components, so that the net result is the 
exchange of (j) r and <^ b in Eqs. (3.40), which leads to the opposite sign for s v {v , fi). 
In other words, a sign inversion in the electron charge produces a sign inversion of 
circular polarization. This also means that by measuring the handedness of circular 
polarization in the various Zeeman components it is possible to determine the sign 
of the electron charge. 

This was just the procedure followed by Lorentz and Zeeman to give, for the 
first time, the correct negative sign for the electron charge. It is curious to note, 
however, that at first they deduced the wrong sign: hence the witty remark by Segre 
(1976) that ‘when signs are involved, even two Dutch physicists as scrupulous as 
Lorentz and Zeeman may make errors’. 
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3.3. Classification of Zeeman Patterns 



The first parameter that can be used to characterize Zeeman patterns is the so- 
called effective Lande factor g, 1 which represents the wavelength shift from line 
center of the ‘center of gravity’ of the cr r components in units of A\ B (sometimes 
also called Lorentz units). For an electric-dipole transition connecting a given level, 
having angular momentum and Lande factor g 1 , with another level of angular 
momentum J 2 and Lande factor g 2 , the value of g can be easily derived from 
Eqs. (3.12) and (3.16) 



9= E S J ff J ffM l: M 2 ) 
m x m 2 



\' f l' r 2 _ \ 

a m 1 m 2 a 

AXl 



o 



E 

M, M„ 



Jo 



J, 



-M 2 M x - 1 



(fliMi g 2 Mff 



(3.41) 



This expression is valid both when the index 1 refers to the lower level and the 
index 2 to the upper level, and in the opposite case. This is a consequence of the 
symmetry properties discussed at the end of Sect. 3.1, and reflects the fact that 
Zeeman patterns depend only on the quantum numbers of the levels involved in 
the transition, and not on which of them is the lower or the upper. 

Obviously, because of the symmetry properties just mentioned (Eqs. (3.18)- 
(3.22)), the center of gravity of the cr b components is -g, while the center of gravity 
of the 7 r components is 0. 

Equation (3.41) can be transformed into a simpler expression by means of some 
Racah algebra. First we note that, with the help of Eq. (2.26cl), we can write for 
M ± and M 2 



M, = /7,(J 1 + 1X2.7, + 1) (-1)'.-". (j£ \ J 
= 1 K 2X 2 + 1 )(— \ \ 



(3.42a) 

(3.42b) 



Next we observe that the term originating from M 1 in Eq. (3.41) can be written, 
using Eq. (2.42), in the form 



E 4 

Mi m 2 v 

- E (-D 



■A 

M 1 - 


E<- 


-l) J i 




( J i 


Ji 


J 


\ M x 


~M\ 


( J2 


■h 




\ M 2 


M[ 


-V 



■h Ji E = 
M 1 —M 1 0 J 

( h J i M 

\-M 2 M x -1 ) 



1 



The quantity g is often denoted by the symbol z in the literature concerning magnetic stars. 
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_ (_1) 1+J i + J 2 



1 

-1 



1 

1 




(3.43) 



Performing the same transformation on the other term of Eq. (3.41), and taking 
into account the analytical expressions of the 3 -j and 6 -j symbols (Eqs. (2.26d) 
and (2.36d)), we obtain 



9 = 2 (Si + 9 2 ) + 4 (Si - 92) d - 



(3.44) 



where 

d = J 1 {J 1 + 1) — </ 2 (J 2 + 1) • 

Note that this formula, originally due to Shenstone and Blair (1929), is invariant 
under interchange of the indices 1 and 2. In most cases, Eq. (3.44) is employed 
together with the Lande factors obtained from the L-S coupling scheme (Eq. (3.8)). 
The resulting g- value, that will be denoted in the following by g L g, can be easily 
calculated for any given electric-dipole transition (see e.g. Beckers, 1969b). A more 
appropriate value for g can be obtained by substituting in Eq. (3.44) the values g 1 
and g 2 deduced by spectroscopic measurements. In some cases, and especially for 
lines belonging to complex spectra and involving levels of high excitation potential, 
the L-S coupling scheme fails, so that the value g LS may be very different from g. 
Comparisons between g and g LS values for several lines of the Fel and Fell spectra 
have been published by Landi Degl’Innocenti (1982a) and by Solanki and Stenflo 
(1985). 

Typical g values range from 0.5 to 2.5, but, for some transitions, g can attain 
null or even negative values, while for other transitions it can be larger than 2.5. A 
list of transitions characterized by such ‘anomalous’ values of the effective Lande 
factor is presented in Table 3.2. The list includes all the transitions between L-S 
levels having S <7/2 and L < 5, and subjected to the following limitations 



AJ = 0, ±1 , AS = 0, ±1 , AL = 0, ±1, ±2 . 



Lines having either exceptionally large values or null values for their effective Lande 
factor are of great importance for the study of solar (and stellar) magnetism. Obvi- 
ously, lines with large g values produce strong polarization signals, and are partic- 
ularly useful for the detection and measurement of magnetic fields. On the other 
hand, lines with g = 0 are especially useful for velocity measurements and for 
calibration. A reduced list of lines with large (j- values has been given by Harvey 
(1973), while Sistla and Harvey (1970) have presented a list of g = 0 lines. A table 
of close line pairs (less than 100 A apart), with one line having a large g value and 
the other a small (or null) g value, has been presented by Robinson (1980). 

In some studies of stellar magnetism, it can be useful to compute an average 
value for the effective Lande factor over the lines contained in a given statistical 
sample or in an assigned spectral interval. For instance, considering the sample 
of approximately 400 unblended lines of the Fel solar spectrum given by Stenflo 
and Lindegren (1977), if the effective Lande factor of each line is weighted by the 
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TABLE 3.2 



Transitions having ‘anomalous’ values for the effective Lande factor g LS - The transitions listed 
are those obeying the restrictions specified in the text. 
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central depression observed for the same line, one finds the value (g BB ) = 1-22 
(Landi Degl’Innocenti, 1985a). 1 

Besides the effective Lande factor, higher order moments can be introduced to 
characterize Zeeman patterns. For each type of component we define the barycen- 
tric n-th order moment Vq to be 

E S^{M 1i M 2 )\a\ J ^ i J I i -qgAxS , (3.45) 

m,m 2 

where AX Em an d Sq lJ2 (M 1 ,M 2 ) are defined in Eqs. (3.20) and (3.16), and where 
q = — 1,0, +1 for the cr b , 7 r, and cr r components, respectively. 

Using the symmetry properties derived in Sect. 3.1 (Eqs. (3.18)-(3.22)), and 
recalling that g is invariant under interchange of J x and J 2 , one easily gets 

(3-46) 

2 ) = (-1 ■ (3.47) 

Equation (3.46) shows that the various moments are not altered by the interchange 
of the lower and the upper level - an obvious consequence of the same property 
holding for the Zeeman patterns. Equation (3.47) shows that the odd-order mo- 
ments of the 7r components identically vanish, while, for the cr components, the 
even-order moments of the cr b and cr T components are the same, and the odcl-order 
moments have opposite sign. Obviously, all the even-order moments are positive 
quantities. 

We will now show that the barycentric moments defined in Eq. (3.45) can be 
reduced to a simple form which contains 3 -j and 6 -j symbols only. Substitution of 
Eqs. (3.20) and (3.44) into Eq. (3.45) leads after some algebra to the expression 

v { q n \^J 2 ) = A\ n B ( 9l - g 2 ) n *< n) (Ji, J 2 ) , (3.48) 

with 

X^\J 11 J 2 )= E Sq lJ *(M x ,M 2 ) 

m x m 2 L j 

= ± (”) ( d -A «)"" Zf'Oi, A) • (3.49) 

where Zq l \j x , J 2 ) is the i-th moment of M x weighted by the strength of the q- 
transition originating from M 1 , 

Z^\J 1 ,J 2 )= E MiS J q ^(M 1: M 2 ). (3.50) 

MiM 2 

Owing both to the L-S assumption and to the procedure itself by which it is obtained, this 
value has only an approximate and statistical meaning. 
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The quantities zip (,J l . J 2 ) can be evaluated by some Racah algebra. For this 
purpose we note that the ct-component of an angular momentum a can be written 
in the form (see Eqs. (3.42)) 



a = (— l) a-a \/ a(a + l)(2a + 1) 




a 

—a 




(3.51) 



Squaring this equation we get 



a 



2 



a(a T l)(2a T l)j 




a 

—a 



l\ fa a l\ 
OJ \a - a OJ ’ 



or, with the help of Eq. (2.43) 



a 2 = (-l) a+ “ [a(a + l)(2a + 1)] 



x^(2/ + l) 
/ 



1 1 / 

a a a 



1 f\(a a f\ 

0 0 J -a 0 J ’ 



(3.52) 



where the sum over / runs over the values 0 and 2 only (for f = 1 the first 3 -j 
symbol in the right-hand side is zero). 

Repeated application of the same procedure gives the following remarkable for- 
mula 



= (-l)( 2n - 3)o+a [o(o + l)(2o + 1)] 



n/2 



x ^ (2f 1 + l)(2/ 2 + 1) • • • (2f n _ 1 + 1) 



a a a 



0 ° 0 / 

a a /„_! 
a — a 0 



l/ 1 


fl 




J 1° 


a 


a J 


f 1 


h 


f 2 ) 


l 0 


0 


0 J 



1 / n—2 f n—l 



1 f n—2 f n— 1 
0 0 0 



(n = 2,3,...) • 



Owing to the property (2.24) of the 3 -j symbols, the indices f lt f 2 ,. . . are alternately 
even and odd numbers. Moreover, they are chained in such a way that the allowed 
values for f i+1 are (f i ± 1), starting from f 1 = 0 and = 2. For example, if n = 4 
the chains of allowed values for f \ , / 2 , and / 3 are: (0,1,0), (0,1,2), (2,1,0), (2,1,2), 
(2,3,2), (2,3,4). 

We now use the above expansion to express the quantity M{ in Eq. (3.50), and 
substitute the explicit expression for the strengths given by Eq. (3.16). Performing 
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the sum over M l and M 2 by the same method that leads to Eq. (3.43), we obtain, 
for * > 2 



= (-l)( 2i - 3 ) J i- J 2+9 3 + 1)(2J 1 + l)] i/2 



x £ ( 2/ 1 + 1 )( 2/ 2 + 1 )---( 2/ j _ 1 + 1 ) 

f i-l 



X 



x 



X 



( 1 1 A 

\Jl J, Jj 



f 1 1 fi- 

\—q q o 




i A 

0 0 




/l h \ /I /i-2 fi—1 \ 

■h M "U Ji -/i J 



/ 2 

0 



1 /i-2 

0 0 



fi-1 

0 



1 /i-ll 
•A ^2 J ’ 



while for i = 0 and * = 1 we have 



Zf(J 1 ,J 2 ) = l 

J 2 ) = (-l) J ' +J * +q 3 V /J 1 (J 1 + 1)(2J 1 + 1) 



( 1 1 




/ i 


i 


1 \ 


\-q q 


o) 




■h 


■hi 



These formulae, together with Eqs. (3.49) and (3.48), give the required expres- 
sions for the barycentric moments. Using Eqs. (2.26) and (2.36) one can indeed 
obtain simple analytical formulae for the low-order moments. The results, that 
are contained in Table 3.3, show the characteristic symmetrical behavior of the 
quantities X g n \j 1 , J 2 ) under interchange of J 1 with J 2 and of q with —q. 

Besides the barycentric moments of the components, it is also convenient to 
introduce the moments tout court. These moments are defined by 



w 



(n) 



(Jl,J 2 ) = 



E 

M, M r 



Sq lJ2 (M 1 , M 2 ) 



\A\ 



T A 

M, 



(3.53) 



and it can be easily seen that they satisfy the same symmetry properties found for 
the barycentric moments (Eqs. (3.46) and (3.47)). 

The relations between the two types of moments are the following 

= E (- i r _ i f nN ) 

i=0 

™ ( g n) ( J i,J 2 ) = E (?) [qg^B]"-* • 

z=0 w 
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Analytical expressions for the low-order barycentric moments of Zeeman components. (*) The 
expressions have been simplified taking into account that, for electric dipole transitions, the 
quantity [d(2s — d?)] is identically zero. 




w^\J 1 ,J 2 ) = AX n B G^(J 1 ,J 2 ), (3.54) 



where 

j 2 ) = it( n ) (fli - 9 2 y (92 . (3.55) 

i—0 ' 1 ' 

Analytical formulae for the quantities Gq' l> (,/, . J 2 ) up to n = 4 are contained in 
Table 3.4. 

An alternative derivation of the quantities Vq" ' 1 and Wq ^ in terms of Bernoulli 
polynomials has been given by Mathys and Stenflo (1987a, b), who employ the 
notations and n! (— AX B ) n Cn \ respectively. In their second paper (1987b), 
these authors give also extensive numerical tables for and Cn these last 
quantities are however evaluated, for any given transition, by assuming for the 
Lande factors the values deduced from the L-S coupling scheme. 

Zeeman patterns can be classified into four different types according to the value 
of their barycentric moments, with particular emphasis on the third-order mo- 
ment v{ 3 \ 

First of all, we will call Type 0 the normal and anomalous Zeeman triplets (see 
Sect. 3.1): obviously, these patterns are characterized by Vq 1 ' 1 = 0 for n > 1. A 
complete list of transitions giving rise to Type 0 patterns has been published by 
Mathys and Stenflo (1987b). 
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TABLE 3.4 

Analytical expressions for the low-order moments of Zeeman components G) 

G™ = 1 

G« = 0 

G ±l = ±(| 9 S + \ d 9 d ) = ±9 

G o 2) = TO (3 s~d 2 - 2 )g\ 

G ±l = M 2 + 2s “ %d + + 5 9?] 

G ( 0 3) = 0 

G ±l = ± m [( 3d2 ~ 7 ^ d 9d + 6 ( d2 + 2s - 3 )9d9s + + 10 9?] 

G 0 4) = T3o (“ 3d4 + 7d2 + 156,2 - 30s + 20 )9 d 

G ±l = sfe [(“ 4 - 14rf2 + 12s2 - 52s + 51 )9d + 14 ( 3d2 - 7 ) d 9 d 9 s + 42 (d 2 + 2 s - 3)g2 fl 2 

+70d 9d9 3 + 35^] 

^ ^ s = 4~ 1) 4- J 2 ( d 2 4" 4)] , d = + 1) — 4" 4)] 5 — ($1 4" H 2 ) ’ = (9 1 ~ 92 ) 

For anomalous Zeeman patterns, we will call, following Back and Lande (1925), 
Type I those having u) 3) > 0, Type II those having t>) 3) < 0, and Type III those 
with v) 3) =0. As it can be argued from the definition of the barycentric moments 
(Eq. (3.45)), patterns of Type I have their strongest a r component toward the blue 
side of the spectrum, while patterns of Type II have their strongest a r component 
toward the red side of the spectrum. Finally, Type III patterns have a components 
symmetrical about their center of gravity. Some examples of Zeeman patterns of 
different types are shown in Fig. 3.2. 

From the analytical expressions given in Table 3.3 and from Eq. (3.48) we have 
<4 3) = Ax b (Si - 3 2 ) 3 d (4 - d 2 ) . 

This formula shows that Type III patterns can only originate when d = 0, or, in 
other words, when J l = J 2 . By contrast, Type I and Type II patterns correspond 
to transitions having AJ = ±1. In particular, Type I patterns arise when the 
Lande factor of the level having the smaller J is larger than the Lande factor of the 
level having the larger J, while the opposite situation leads to Type II patterns. 

It should be remarked that the classification of Zeeman patterns, although based 
on the <7 components, characterizes at the same time the behavior of the tt com- 
ponents. Indeed, with increasing distance from line center the strengths of the 7 r 
components decrease (quadratically) for Type I and Type II patterns, while they 
increase (also quadratically) for Type III patterns. This is apparent from Table 3.1 
and Eq. (3.20). 
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Fig. 3. 4. Definition of the minimal equivalent Zeeman pattern. 



It should also be mentioned that a useful concept for certain applications is that 
of minimal equivalent Zeeman pattern. Given an arbitrary Zeeman pattern, we 
define its minimal equivalent as the pattern having only two 7r components and 
two <7 components (both for the blue and the red wing) with such splittings and 
strengths that the barycentric moments up to n = 3 of the original pattern are 
reproduced. The minimal equivalent Zeeman pattern is sketched in Fig. 3.4. It can 
be shown that the strengths S i and the splittings z i (expressed in units of A\ B , 
and satisfying the condition z 2 > z x ) are given by 




where 

n Y ^ 

= _Sd_ A i 

1 9d\ [X< 2) ] 3/2 ' 

Note that the value of z remains undefined when X[ 2> = 0; this is a degenerate 
case corresponding either to a Zeeman triplet (Type 0 pattern) or to the transition 
1/2 — 1/2. In both cases the two a components merge into a single one with 
z i ~ z i = 9i (‘-’l ~b ‘-’ 2 ) = !• 



3.4. The Paschen-Back Effect 

As already stated in Sect. 3.1, when the magnetic field is so strong as to produce 
on a J-level a splitting comparable with the energy separation between different 
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J-levels, the perturbation theory presented for the Zeeman effect cannot be applied 
any longer. In this regime, that is called the Paschen-Back effect regime, the energy 
levels have to be found by diagonalization of the total Hamiltonian H given by 

h = h 0 + h b , 

where H 0 and H B are the unperturbed and magnetic Hamiltonians, respectively, 
with H b given by Eq. (3.3). 

The diagonalization of the Hamiltonian H can be performed on the basis of the 
eigenvectors | aJM) of H 0 (see Eqs. (3.4)). This implies the evaluation of matrix 
elements of the form 

(aJM\H B \a'J'M') = /z 0 <aJM|(L + 25) • B\a'J'M') . 

A calculation similar to that of Sect. 3.1 leads to the expression 

(aJM\H B \a'J'M'y = n 0 B ^ (-l) J ' +M+q+1 y/2J + 1 

Q 

x 'y/j(j + 1) Sjj, + (aJ\\S IK J'>] , 

and, in particular, when the quantization axis for J is in the magnetic field direction 

(aJM\H B \a'J'M'} = 

= PqH M 6 aa , Sjj, 

+ { _ 1) J'+m+i V 2JTi (_ J m J m (aJWSWa'J')} S MM , . (3.56) 

Equation (3.56) shows that the magnetic Hamiltonian H B connects only eigenstates 
having the same value of the magnetic quantum number M: in other words, it is 
block-diagonal, each block being characterized by an assigned M-value. 

The general problem of finding the eigenvalues and the eigenvectors of the matrix 
H can be solved only by numerical methods. There are, however, some special cases 
where the analytical calculations can be pushed somewhat further. If we suppose, 
for instance, that the atomic system is exactly described by the L-S coupling 
scheme, the eigenstates of H 0 are of the form \/3LSJMy (where /3 summarizes the 
electronic configuration quantum numbers), so that Eq. (3.56) becomes 

(/3LSJM\H b \j3’L' S' J'M'y = 

= H 0 B M Spp, S LL , 5 SS , Sjj, 

+ (-1) j ' +m+1 V2JTT J m J) mSJ\\S\\p'L'S'J'y] S MM , , 



( J J ’ ^ 
\-M M' qj 
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and using Eqs. (2.109) and (2.97) 

(0LSJM\H b \(3 L S J M ) = n 0 B 5pp, S LL , S ss , 5 MM , 

x MSjj, + {-1) j+jI+l+s+m ^(2J + 1)(2 J' + 1)5(5 + 1)(2S + 1) 



J 


J 1 




[ J 


J ' 


ni 


—M 


M 


oj 


\S 


s 





(3.57) 



From this expression we obtain the important result that in the L-S coupling 
scheme the magnetic Hamiltonian is block-diagonal, each block being now charac- 
terized by the magnetic quantum number M and by the quantum numbers (/3, L 1 S) 
specifying the term. In other words, the magnetic field produces a J-mixing of the 
various levels belonging to a particular term, so that the eigenvectors of the total 
Hamiltonian will be of the form 

| (3LSjM) = Y C J j{pLS, M ) \/3LSJM) , (3.58) 

j 



where the index j labels the different states of the iV-fold degenerate subspace 
corresponding to assigned values of (/?, L, S, M); N is given by 

N = l + L + S — max(|M |, | L — jS|) . 

Of course, in contrast with the Zeeman effect regime, J is not a good quantum 
number. 

We now recall that in the L-S coupling scheme the energies E^ LS {J) of the 
different J-levels originating from a given term (/?, L , S) are determined by the 
spin-orbit Hamiltonian H so (included in H 0 ), which in most cases can be written 
in the form (see Condon and Slrortley, 1935) 

H so = CC QLS) L-S , (3.59) 

where £ is a constant having the dimensions of energy. Writing L-S in the form 

L-S =\[J 2 ~L 2 -S 2 ] , 

we have 1 

Efs LS (J) = <(3LSJM\H so \(3LSJM > 

= i C (PLS) [J(J + 1) - L(L + 1) - S(S + 1)] . (3.60) 

Note in passing that the energy difference between two adjacent J-levels is given by 
EfiLsV ) - E (3ls( J - !) = C(PLS) J , 
which is the well-known Lande interval rule. 
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Thus the whole problem of finding the energy levels of an atom described by the 
L-S coupling scheme, embedded in a magnetic field, reduces to the diagonalization 
of a set of matrices of the form 

(/3LSJM\H so + H B \/3LSJ'M) . 

From Eq. (3.57), using the analytical expressions of the 3 -j and 6 -j symbols 
(Eqs. (2.26) and (2.36)), we have that the only non-zero matrix elements are those 
of the form 

(PLSJM\H so + H b \0LSJM > = E 0LS (J) + g 0 B g LS {J) M (3.61a) 

(j3LS J-l M\H so + H b \0LSJM) = </ 3LSJM\H so + H B \/3LS J - 1 M> 

l-i^B j (T+S'+T+l) (T — S-\-L ) (</ -\-S — L ) ( — J +S , +T-I-1) (T 2 — A7 2 ) , 

= 2J~ y (2J+1)(2J— 1) ’ (3 ' 61b) 

where g^giJ) is the Lande factor defined in Eq. (3.8). 

Diagonalization of the matrix (3.61) (which is seen to be tridiagonal, real, and 
symmetric) gives the energy eigenvalues A j((3LS, M) and the corresponding eigen- 
vectors expressed in terms of the coefficients Cj(/3LS, M) of Eq. (3.58). Since the 
matrix is real and symmetric, the coefficients Cj can be chosen to be real. More- 
over, from general theorems concerning the diagonalization of matrices one can 
prove the relations 

Cj((3LS, M) C 3 j 0 3LS , M) = S jjf (3.62a) 

J 

C 3 A/3LS, M ) C’j, ( 0LS , M) = Sjj, (3.62b) 

3 

C 3 j(/3LS, M ) C 3 j, (/ 3LS , M)X j {(3LS : M) = 

3 

= (pLSJM\H ao + H B \/3LSJ'My . (3.62c) 

The diagonalization of matrix (3.61) can be performed analytically for doublet 
terms only ( S = 1/2). The energy eigenvalues are found to be 

Ai = - L C + L _|_ i M 
for M = ±(L + 1/2), and 

^1,2 = C + Mo BM ± - 
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Fig. 3. 5. The energy levels of the term 2 P plotted as functions of the magnetic field strength. The 
energy E (vertical scale) and the magnetic energy // () (horizontal scale) are both normalized to 
the fine-structure energy (supposed positive). For an inverted multiplet (£ < 0) the graph is 
simply reversed about the E = 0 axis. 



for M ±(L + 1/2). In all the other cases, the diagonalization can be performed 
only by numerical methods. 

Figures 3.5 to 3.9 illustrate the behavior of the energy eigenvalues of the most 
common L-S terms as a function of the magnetic field strength, parameterized 
through the quantity 7 defined by 



7 = ^ 



(3.63) 



For 7 <C 1 the eigenvalues spread out linearly from their degenerate, zero-magnetic 
field value (Zeeman effect regime). For intermediate values of 7, the eigenvalues 
start crossing each other and the linearity with the magnetic field is, in general, lost 
(■ incomplete Paschen-Back effect regime ); finally, for large values of 7 (7 1), the 

eigenvalues behave again linearly with 7, as shown in Fig. 3.10 ( complete Paschen- 
Back effect regime). 

In this last regime, the spin-orbit interaction can be considered as a perturbation 
in comparison with the magnetic interaction. On the other hand, the magnetic 
Hamiltonian is diagonal on the basis \fiLSM f M s y , and the eigenvalues are given 
by 

(PLSM L M s \H B \(3LSM L M s y = (M L + 2 M s ) , 

which explains their linear behavior with B for 7 1. As the magnetic held 

increases from 7 <C 1 to 7 > 1 , the energy eigenvectors gradually evolve from 
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Fig. 3. 10. For large values of the magnetic field, the energy eigenvalues spread out linearly. This 
figure is just an enlargement of Fig. 3. 5 and is intended to show the behavior of the eigenvalues 
for 7 1 . 



the form \(3LSJM') to the form \j3LSM L M s '} . In other words, the magnetic held 
induces a gradual basis-transformation on the energy eigenstates. 

As apparent from Figs. 3. 5-3. 9, the incomplete Paschen-Back effect regime is 
characterized by several ‘crossings’ of the energy eigenvalues, each corresponding 
to a well-defined value of the 7 parameter. It will be shown in Sect. 10.18 that 
such level-crossings induce important phenomena in resonance scattering. Table 3.5 
gives, for the most common L-S terms, the values of 7 relative to each level-crossing, 
and the corresponding M-values of the levels involved. To illustrate how this table 
should be used, consider the 3p 2 P term of Nal: from the tables of Moore (1949) 
we get a value for the constant ( equal to 11.4642 cm -1 , whence we deduce that 
level-crossing takes place for B = 1.64 x 10 5 G and for B = 2.46 x 10 5 G. 

In some cases, and particularly for light atoms such as He, the spin-orbit inter- 
action cannot be described by the simple expression given in Eq. (3.59). In these 
cases the energy levels in the presence of a magnetic field can still be obtained by 
diagonalizing the matrix given in Eqs. (3.61), but the actual values E^ LS (J) are 
now to be found in tables of atomic energy levels (see e.g. Moore, 1949, 1952, 1958) 
or in the specialized literature. 

Let’s now consider how the splitting of the energy levels due to the magnetic 
field affects, in the general case of the Paschen-Back regime, the spectral lines 
originating in the transition between two given terms. The main properties of the 
Paschen-Back patterns resulting from the electric-dipole interaction are described 
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TABLE 3.5 

Level-crossings for the most common L-S terms. The magnetic field is expressed through the 
parameter 7 defined in the text. The M-values refer to a normal multiplet (f > 0). For inverted 
multiplets they must be changed in sign. 
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below. 

First of all, since the projection M of the total angular momentum on the mag- 
netic field direction is a good quantum number, the individual components can be 
characterized by the value AM = (M' — M), where M refers to the lower level and 
M' to the upper one. Owing to the selection rule (3.15) we thus obtain that - in 
strict analogy with the Zeeman effect - the components can be divided into three 
distinct groups: a r ( AM = —1), n (AM = 0), and a h (AM = +1). 

As far as the strengths of the components are concerned, it can be shown that 
they are proportional to the quantities 

s jM,j' M ' = | ^LSjM\r q \f5'L'S'j'M'y | 2 (q = -1, 0, +1) , 

where the unprimed quantum numbers refer to the lower level, the primed ones to 
the upper level, and where r q are the spherical components of the position operator 
of the optical electrons. 

Expanding the eigenvectors as in Eq. (3.58), and taking into account that the 
coefficients Cj are real, one obtains 

S 3M,j'M' = C J j(/3LS, M) C j j„ ( (3LS , M) C j j, pL'S', M') C j j„, pL'S', M') 

x (/3LSJM\r q \/3'L'S'J'M'y(/3LSJ"M\r q \/3'L'S'J'"M'y . 

Evaluating the matrix elements through the Wigner-Eckart theorem (Eq. (2.96)), 
and noting that r is an operator which acts only on the orbital part of the eigen- 
vector, one gets from Eq. (2.108) 



gjM,j'M' _ 

= C J 0 s LS , M ) C j j" (PLS, M) C J j, (pL'S, M') Cj,„ pL'S , M') 

x (2 L + 1)^(2 J + 1)(2 J> + 1)(2 J" + 1)(2 J'" + 1) 



r l l' i 
\J' J s 



L L' 1\ 

J'" J" S J 



( ./' J l W J"’ J" 
^ -M’ M -q) —M' M 



\ (PL\\r\pL'}\ 2 S ss , 



a formula that contains all the relevant selection rules, namely 



AL = 0, ±1 L = 0^L' = 0 
AS = 0 
AM = 0,±1 . 
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Summation over all the possible transitions gives (see Eq. (3.62b)) 

E s j q M ' j ' M ' = e ( 2L + l )( 2j + 1 )( 2 ' 7 ' + !) 

jj' MM' J J' MM' 

x(_M’ M _ 1 ,)" |<,Jt|W ' t '> |2 - 

which can be reduced, using Eqs. (2.23a) and (2.39), to 

E S i M ' j ' M ’ = l^L+l)(2S+l)\(PL\\f\\P'L')\ 2 . 

jj' MM' 

This property makes it possible to define the normalized strengths in the form 



L L’ 1 
J’ J S 



gjM,j'M' _ 

= V ;T7r — r Cj (PLS, M) C j j„ ( (iLS , M) C j j, (/ 3' L'S , M') cj,„ {(3' L'S, M') 

Zis) 1 



with 



x s/(2J + 1)(2J' + 1)(2 J" + 1)(2J"' + 1) 



V 




f L L' 




J 


4 


\ ./'" J" 


4 



( ./' J 1 \ / J"' J" 1 \ 
\-M' M -q)\-M' M -q J ’ 



E = i (g = -i,o, +i) . 

jj' MM' 



(3.64) 



The strengths as defined by Eq. (3.64) have some important properties: 

a) In the absence of magnetic fields, J is a good quantum number, and the eigenvec- 
tors |/3 LSjMy converge towards the states | j3LSJM)> which are degenerate with 
respect to M. For the coefficients Cj we have 



Cj{f3LS, M) = S.j, 



so that 

gJM,j'M' 



3 

2S + 1 



(2J + 1)(2J' + 1) 



[ L 


L' 


n 2 / 


j’ 


J 


\ J’ 


J 


4 1 


—M' 


M 




and summing over the degeneracy parameters M and M' we obtain the usual 
expression for the relative strengths of a fine-structure multiplet 



S J ' f = 






^ S q 
MM' 



jm,j'm’ _ (2 J + 1)(2 J' + 1) 



2S+1 



L 

J' 



V 

J 



(3.65) 
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b) The centers of gravity in frequency of the 7r, a T , and er b components ( q = 0, +1, 
and —1, respectively) are linear functions of the magnetic field 

Avq = -qvh, (3.66) 

where is the Larmor frequency and Av q is defined by 

Av q = E I (3-67) 

jj' MM' 

with Au^ im , the frequency shift of the component corresponding to the transition 
between the lower state \PLSjM} and the upper state | (3'L'Sj'M'y from the 
unperturbed frequency of the transition between the two terms (/ 3LS ) and (/3 1 L' S). 

To prove Eq. (3.66) we substitute into Eq. (3.67) the strengths of Eq. (3.64) and 
the frequency shifts 



Au n 



(fl'L'S, M') — X j (/3LS, M) 
h 



(3.68) 



where Xj, Xj, are the eigenvalues of the matrix (3.61) for the lower and the upper 
term, respectively. We then perform the sums over j and j' using Eqs. (3.62b) and 
(3.62c), to obtain 



E E 



MM’ 



2 5- 



- V(2 j + 1)(2 J' + 1)(2 J" + 1)(2 J'" + 1) 



L U 1 



LI! 1 



J' J 5[1 J"' J" S 



J' J l 



J'" J" 1 



-M' M - q)\-M ' M -q t 
Sjj,, ((3'L'SJ'M'\H so + H B \p'L'SJ'"M'y 
- (pLSJM\H so + H b \0LS.J"M) 



(3.69) 



It is easy to prove that the spin-orbit Hamiltonian gives zero contribution. Indeed, 
noting that this Hamiltonian is diagonal with respect to the quantum numbers J 
and M, and that its matrix elements are independent of M, and performing the 
sums over M and M' via Eq. (2.23a), we have for this contribution 



/ \ (2J + 1)(2 J' + 1) ( L L' 1 ) 

[ o hjy, (25+1) \J' J SJ 



x 



(/3'L'SJ'M'\H so | P'L'SJ’M' > 



(pLSJM\H so \pLSJM) 
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Summation over J for the first term, and over J' for the second term gives, with 
the use of Eq. (2.39) 



(^Jso h 



E for/ 



2./' + 1 



(2L' + 1)(2S + 1) 



E 



2J + 1 



(2 L + 1)(2S + 1) 



(l3'L'SJ'M'\H so \(3'L'SJ'M'y 



(/3LSJM\H so \/3LSJM) 



The first sum is simply the average energy of the fine-structure levels of the upper 
term, while the second sum is the same quantity for the lower term. It can be 
easily proved that both these quantities are identically zero when the spin-orbit 
Hamiltonian is described by Eq. (3.59). 1 

The only non- vanishing contribution to Eq. (3.69) is due to the magnetic field. To 
evaluate this contribution, we substitute the matrix elements given by Eq. (3.57) 
and perform the sums over M and M' by means of Eq. (2.42). After some tedious 
Racalr-algebra calculations which involve the use of Eqs. (2.38), (2.48), (2.55), and 
(3.51), we obtain that only the first term in the square bracket of the matrix 
element (3.57) gives a non-zero contribution, and we get the expression 



. 3 Pr 

Uq = 25+1 



J J' 



f L 


L' 


1 f 2 


(1 


1 


u 


J 


s\ 




o 

1 



\J J'{.V + 1)(2./' + 1) j J, j, 



+ \J J(J + 1)(2J + 1) 



111 
J J J' 



Evaluating the 6 -j symbols in the square bracket by Eq. (2.36d) and the 3 -j symbol 
by Eq. (2.26d), and performing the sums over J and J' via Eq. (2.39), we finally 
obtain Eq. (3.66). 

The theorem just proved shows an interesting property of the Paschen-Back effect 
which can be summarized by the following statement: The frequency shifts of the 
centers of gravity of the a r , er b , and tt components in the Paschen-Back effect regime 
are those typical of the normal Zeeman effect between two spinless levels. In other 
words, given a fine-structured lower term (/ 3LS ) and a fine-structured upper term 



To prove this statement, one has just to recall the sum rules for integers 



EL i* = 



in(n + l), ELL 2 = gn(«+l)(2n + l), ELi fe3 = \ " 2 (" + 1 ) 2 



If, on the other hand, the spin-orbit Hamiltonian cannot be written in the form (3.59), the quantity 
(^ 1 / ) so may indeed differ from zero. In such cases, it represents a constant shift (independent 
of q ) that affects the unperturbed frequency of the transition. In other words, Eq. (3.66) still 
holds provided Eq. (3.68) is replaced by 



Av n = - 
MM' ^ 



M')-\ j ((3LS , M) 



- ( Au q) SO' 
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( (3'L'S ), the centers of gravity of the cr r , er b , and tt components have the same 
frequencies as the individual components that would originate from the transition 
between a spinless lower level (/ 3L ) and a spinless upper level (/ 3'L '), both split 
according to the normal Zeeman effect. This result is what should be expected 
from the principle of spectroscopic stability, as it can be inferred from considering 
the limit of negligible fine structure. 



3.5. Magnetic Field and Hyperfine Structure 



It is well-known that hyperfine structure results from the interaction of the elec- 
tronic cloud with the atomic nucleus. The energy involved in this interaction is 
approximately one thousand times smaller than the energy relative to the spin- 
orbit interaction, so that the hyperfine-structure components of a spectral line 
have typical wavelength separations of a few mA. 

In most cases (and particularly for lines originating from astrophysical plasmas), 
these components remain unresolved, because their separation is much smaller than 
the line width due to different broadening mechanisms. For this reason the hyper- 
fine structure of spectral lines is often neglected, especially in usual astrophysical 
applications. 

However, when polarization phenomena are involved, the situation is substan- 
tially different, and hyperfine structure generally plays an important role (see 
Sect. 9.23). We describe here the effect of a magnetic field on a hyperhne-structured 
line, in order to state some basic properties of the resulting pattern. 

We consider a particular isotope having nuclear spin /, and introduce the rep- 
resentation | aJIF/y to describe its energy eigenvectors. In this notation, apart 
from the symbols already used, F is the total angular momentum (electronic plus 
nuclear: F = J + I) quantum number, while / represents its projection on the 
z-axis of the coordinate system (/ = —F, —F + 1, . . . , F). 

The hyperfine-structure interaction energy can be expressed as an infinite series 
of electric and magnetic multipoles (see for instance Kopfermann, 1958). In most 
cases, a very good approximation to the observed energy intervals is obtained by 
retaining only the first two terms in the multipole series; these are the magnetic- 
dipole and electric-quadrupole terms, which are given by 

{aJIFf\H^ s \aJIF’fy = S FF ,S ff , K 



(aJIFf\H™\aJIF’f) 



bppi S f ^ H(ct, J, I) 



K(K+1)--J(J + !)/(/+!) 



(3.70) 



where A(a, J, I) and B(a, J , I) are the magnetic- dipole and electric-quadrupole hy- 
perfine structure constants relative to a given atomic level of a given isotope, while 



l 



A formulation of this principle is given in Sect. 7.10. 
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K is defined by 

K = F(F + 1) — J(J + 1) — I (I + 1) • (3.71) 

An extensive table of experimental A and B values can be found in Brix and 
Kopfermann (1952). From such values, and from Eqs. (3.70), the energy of the 
various hyperfine-structure sublevels can be easily computed. The sublevels are 
characterized by the total angular momentum F and are degenerate with respect 
to /. 

If a magnetic field is present, so weak as to produce a splitting much smaller 
than the energy differences between the various hyperfine-structure sublevels, its 
effect can be deduced from perturbation theory, by diagonalizing the magnetic 
Hamiltonian H B of Eq. (3.3) on the degenerate basis | aJIFff. 1 Aligning the 
z-axis of the reference system with the direction of the magnetic held, the matrix 
elements to be evaluated are of the form 



(aJIFf\g 0 B(L z + 2S z )\aJIFf) . 

Let us evaluate the more general matrix element (that will be needed later) 
between two states having different F-values. Writing (L z + 2 S z ) in the form 
( J z + S z ) and bearing in mind the definition of the Lande factor (Eq. (3.7)), we 
obtain from Eqs. (2.96) (Wigner-Eckart theorem) and (2.108) 



(aJIFf\p 0 B(J z + S z )\aJIF'f > = 

= g 0 Bg aJ 5 ff , (-1 ) J+I ~WJ(J + 1)(2 J + 1)(2 F + 1)(2 F' + 1 ) 



x\ F ' 


F 




( F 


F' 




x< u 


J 


n 




f 





In particular, for F = F' we have 



(3.72) 



(aJIFf\g 0 B(J z + S z )\aJIFf) = p 0 B g aJ g his (F) f 5 ff , , 
where the hyperfine-structure Lande factor, g his (F), is given by 
1 F(F + 1) + J(J + 1) - /(/ + 1) 



9hk( F ) = 



F(F+ 1) 



(F ± 0) . 



(3.73) 



Thus for weak magnetic fields any lryperhne level of total angular momentum F 
splits up into (2 F + 1) magnetic sublevels, each characterized by its own /-value. 
The splitting is proportional to the magnetic held and to the product of the usual 
Lande factor g aJ times a Lande factor g h{s (F) which depends on the particular 
lryperhne level. This is the Zeeman effect for hyperfine structure. 



The Hamiltonian describing the direct interaction between the magnetic field and the nuclear 
spin can be safely neglected, since the corresponding energy is about 10 3 times smaller than the 
energy involved in the interaction between the magnetic field and the electronic cloud. 
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If the magnetic field is so strong that the Zeeman splitting is of the same order as 
the energy difference between different hyperfine levels, perturbation theory cannot 
be applied any longer, and one has to go back to the simultaneous diagonalization 
of the hyperfine-structure and magnetic Hamiltonians, whose matrix elements are 
given in Eqs. (3.70) and (3.72). The situation here is quite similar to that of the 
Paschen-Back effect (Sect. 3.4): since the matrix elements are diagonal with respect 
to /, the overall Hamiltonian can be factored into [2(1 + J) + 1] submatrices, each 
characterized by a particular /-value. 1 Thus the eigenvalues and eigenvectors are 
of the form 



A i(aJI,f) 

| aJIif} = C l F (aJI , /) | aJIFf) (3.74) 

F 

(where the C l F coefficients can be chosen real) and can be found, in general, 
by numerical diagonalization of the total Hamiltonian. Figure 3.11 illustrates 
the behavior of the energy eigenvalues for the hyperfine-structure components of 
the level 3 2 P 3 / 2 of Nal as a function of the magnetic field (the values of the 
hyperfine-structure constants are from Figger and Walther, 1974). As in the case 
of the Paschen-Back effect, there are several level-crossings between the hyperfine- 
structure magnetic sublevels. Such level-crossings induce important phenomena in 
resonance scattering (see Sect. 10.22). 

The strengths and the splittings of the various hyperfine-structure components 
can be computed in strict analogy with the case of the Paschen-Back effect. For 
the normalized strengths of the hyperfine components of the transition connecting 
a lower level (aJ) with an upper level (a 1 J 1 ) one obtains 



S^f = £ 



FF'F" F" 



2 / + 1 



C^iaJI, /) (aJI, f) & F , (a'J'I, /') & F ,„ (a'J'I, /') 



x \J (2F + 1)(2 F' + 1)(2 F" + 1)(2F"' + 1) 



J J' 1 
F' F I 

F' F 1 
-/' / ~q 



J J' 1 
pm p" J 

pm p" l 

-/' f -q 



with 



(9=-l,0,+l) 



J = 0 J' = 0 



(3.75) 



E = 1 

H'ff 

The relevant selection rules are 

AJ = 0, ±1 
Af = 0, ±1 . 

It is assumed here that the magnetic splitting is much smaller than the energy differences 
among the various J-levels. The opposite case will not be considered in this book. 
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Fig. 3. 11. Energies of the hyperfine sublevels of the level 3 2 P 3 / 2 of Nal as functions of the magnetic 
field strength. 

Similarly to the Paschen-Back effect, the strengths defined in Eq. (3.75) have 
some important properties, which can be proved by observing that the coefficients 
Cp(aJI, /) satisfy the following relations, similar to Eqs. (3.62) 

]T & F {aJI, f) C"(aJ/, /) = S u , (3.76a) 

F 

Y & F (aJI, /) Cj., (aJI, f) = S FF , (3.76b) 

i 

Y C F (aJI, /) Cj., ( aJI , /) \(aJI, f) = 

i 

= <aJIFf\H™ + H™ + H B \a JIF'f) . (3.76c) 

The properties of the strengths are the following: 

a) In the absence of magnetic fields, F is a good quantum number, and the eigen- 
vectors | aJIify converge towards the states | aJIFf) that are degenerate with 
respect to /; the coefficients C F are given by 



C F (aJI, f) — S iF , 
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so that we obtain 



gFf,F'f 



-JL-(2F+l)(2F' + l) 



f J 


J' 1 1 2 / 


' F' 


F 




F I J \ 


-r 


f 




and summing over the degeneracy parameters / and ft one gets the usual expression 
for the relative strengths of hyperfine-structure multiplets 



r F,F' _ v- c Ff,F'f _ ( 2 F + 1)(2 F' + 1) ( J J' 1 'l 

^ * (2/+1) \F' F 7/ 



b) The centers of gravity in frequency of the n, a T , and cr b components ( q = 0, +1, 
and —1, respectively) are linear functions of the magnetic field 

Av q = £ = -qgv h , (3.77) 

ii'ff 

where is the Larmor frequency, Av'fy, is the frequency shift of the component 
corresponding to the transition between the lower state \aJIif ) and the upper 
state | a' J'li' f'y from the unperturbed frequency of the transition between the 
two levels ( aJI ) and (a’J’I), 



Mjf> = 






(3.78) 



and where g is given in terms of the Lande factors of the lower and upper level 



9 — 2 (9a J + 9 a ' J ' ) + 4 (9 a J ~ 9a' J') U ( J + 1 ) - J' (^ + !)] • 



Comparison with Eq. (3.44) shows that the factor g is just the effective Lande 
factor that one would obtain for the transition between the levels (aJ), {a' J') if 
hyperfine structure were not present. 

The proof of Eq. (3.77) is rather tedious and quite similar to the proof of the 
analogous equation for the Paschen-Back effect. For this reason it will not be 
given here and is left as an exercise to the reader. 1 Equation (3.77) is particularly 
important, and it can be expressed by the following statement: The frequency shifts 
of the centers of gravity of the a r , a b , and it components of a hyperfine- structured 
line are the same as those resulting from the same line without hyperfine structure. 
This result is what should be expected from the principle of spectroscopic stability. 

To prove that the hyperfine-structure Hamiltonian (Eqs. (3.70)) gives no contribution to 
the quantity A u q . one needs the sum rules 

k 4 = fg n ( n + l)(2n + l)(3n 2 + 3n — 1), X/fc-i = J 2 ” 2 ( n + l) 2 (2n 2 + 2 n — 1). 

To evaluate the contribution of the magnetic Hamiltonian (Eq. (3.72)), the sum rule (2.41) must 
be used (instead of (2.48), employed for the Paschen-Back calculation). 
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Fig. 3. 12. Logarithm of the strengths of n (upper) and <j (lower) components against wavelength 
for the line EuII A 4205. The value of the magnetic field is 6 kG. Only the components having 
strength larger than 10 -8 are shown. The number of components of each type is approximately 
400. 

The numerical calculation of the strengths and splittings of the various hyperfine- 
structure components of a spectral line requires the diagonalization of the total 
Hamiltonian for the lower and for the upper level followed by the evaluation of 
expressions (3.75). If the element has more than one isotope, the calculations must 
be repeated for each of them and the splittings must be adjusted to allow for isotope 
shift. 

Landi Degl’Innocenti (1978a) has presented a computer program to perform such 
calculations. The program has then been applied to the line EuII A 4205 using the 
data for the hyperfine-structure constants given by Krebs and Winckler (1960). 
The result is shown in Fig. 3.12 for a magnetic field of 6 kG. The number of 
components is indeed striking, with an intricate pattern recalling a ‘wild forest’. 
The implications of such a pattern on the determination of the europium abundance 
in magnetic stars have been discussed by Landi Degl’Innocenti (1975). 



3.6. Atomic Level Polarization and Density Matrix 

When an atomic system is excited through a physical process which, for any reason, 
is not spatially isotropic, the various magnetic sublevels of the system, degenerate 
or quasi-degenerate with respect to energy, are, in general, not evenly populated; 
moreover, they are characterized by definite phase relations, and the atomic system 
is said to be polarized. A typical example occurs when the atomic system is excited 
by a unidirectional (or polarized) radiation beam, or by collisions with a collimated 
beam of fast particles. 
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In such situations, the description of the atomic excitation in terms of the usual 
concept of populations of the atomic levels (or sublevels) is no longer adequate, 
and one must resort to the more sophisticated concept of atomic density operator. 

The density operator can be defined for any assigned quantum system. If p a 
(p a — 0) is the probability of finding the system in the pure state ) (normalized 
to 1), the density operator p is defined by 

p = E^ (a) ><^ (Q) i- ( 3 - 70 ) 

a. 

Introducing a complete and orthogonal basis of unit vectors { | )> } for the Hilbert 

space spanned by the vectors |X°X> one has 

<u m \p\u n } = p mn = EpX u ml0 (a) X0 (a) K> . 

Ot 

where p mn are the matrix elements of the density operator, also called density- 
matrix elements. 

Some important properties of the density operator follow directly from its defi- 
nition: 

a) The density operator is Hermitian, 

P = , 



which leads to the remarkable symmetry property of the density matrix 



Pmn 



= P 



* 

nm ' 



(3.80) 



b) The density operator is positive definite. In fact, for an arbitrary state vector 
| (j>y we have 

<01 p 10 ) = E p » I < 0 (a) 10 ) I 2 > 0 ; 

a. 

in particular, for \<py = |it„) 

Pun > 0 . (3-81) 



and hence the diagonal matrix elements are real and non-negative. 

c) The trace of the density matrix is independent of the basis chosen to define its 
matrix elements. In fact, since a complete, orthonormal basis satisfies the relation 



E KX U J = 1 , 



we obtain from Eq. (3.79) 



TrO) = E^n = EE |, o <wJ0 (a) ><0 (a) K> 

n n ex. 

= E Pc <0 (a °l [E KX'uJ |0 (a) > = E Pa • 



(3.82) 



(3.83) 
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In particular, if the probabilities p a are normalized to unity one gets 

Tr (p) = 1 . (3.84) 

d) The density-matrix elements satisfy the following Schwarz inequality 

\p nm \ 2 < PnnPmm • ( 3 - 85 ) 

To prove this relation, let us consider the state vector 

1^) = Pnm K > - Pun I M m> 5 
for the diagonal matrix element (<fi \ p \ (jP) one gets 

(\(f)\ p 0 /■ Pnn (Pnri Pmm l/^nml ^ > 

which proves Eq. (3.85) since <((/>| p \4>y > 0 and p nn > 0. 

e) If the quantum system is in a pure state rather than a statistical mixture of 
states, or, in other words, if the sum in Eq. (3.79) is restricted to only one state 
\ip <0> ) . the above inequality reduces to an equality 

I Pnm 1 2 = l<« J^l 2 I<UJV’ (0> >| 2 = P nnPmm • 

f) The expectation value of any dynamical variable is equal to the trace of the 
product of the density operator times the operator associated to the variable 

{A) = Y J P a <^ a) \M ^ a) > = Tr { P A) . (3.86) 

Ot 

Indeed, using Eq. (3.82) one obtains 

Tr (pA) = i U n\P A \ U n'> 

n 

= <u n \^2 Pa \^X^ a) \A\u n ) 

n a 

= ^2'^2p a <'>p {0 ‘ ) \ A \u n y<.u n \^ ( ' a) y 

n ot 

= £^ (a) |tiiv’ ( q) > = <^>- 

a 

g) If a system consists of two subsystems a and b, the trace of the density operator 
of the system over the states of one subsystem is the density operator of the other 
one. Indeed, let {| a n )} and {| fr m >} be two complete bases for the Hilbert spaces 
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of subsystems a and b , respectively, and p the density operator of the compound 
system. If O ia ' ) is an operator acting on subsystem a only, we have 

<0 (a) > = Tr (pOM) =£ <aj < b m \ pO <“> | b m > | a„> 

nm 

= E < a j{E <6 ro |p|6 m >}o (o) l«n> 

n m 

= E Tr (b) (p) 0 (a) K>- 

n 

On the other hand, if is the density operator of subsystem a, we also have 
(0 (a) ) = Tr (a) (p (a) 0 (a) ) = E <«„ I P (a) 0 (a) | a„> . 

n 

Comparison of these two relations shows that 

Tr (b) (p) = p (a) . (3.87) 



h) The time evolution of the density operator in the Schrodinger representation is 
described by the equation 

( 3 . 88 ) 

where the symbol [A : B] denotes the commutator of the two operators A and B , 
and where H is the Hamiltonian of the system. 

This equation follows directly from the Schrodinger equation applied to the state 
vectors | tp^') entering the definition of p. In fact, we have from Eq. (3.79) 



d_ 

cl t 



P = EPo 



d t' 



l^ (Q) > )<^ (a) l + l V’ (a) >(^<V’ c 



= E^aT? H\^ a) X^ a) \- E p “ T \^ a) x^ a) \ H . 



which proves Eq. (3.88). 

As far as the density-matrix elements are concerned, their time evolution depends 
on the particular basis { | u n ) } . If we choose the basis of the energy eigenvectors, 
or, in other words, if the vectors \u n ) are such that 

#K> = E n\ U n> - 



we have from Eq. (3.88) 



cl 

df Pmn 



cl 
d t 



<' U m\P\ U n> 



27T 

— OJ [H,p] I U n> = 27 ri l^mn Pn 
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where the Bohr frequency v mn is defined by 



E m~ E n 



On the other hand, for a different basis one gets 



d 

d t Pmn 



27ri 

~h 



n 



tt n rr 

11 mp rpn 11 pn rrt 






where 

H ij = <XI#K> • 

For an atomic system, the most natural basis for defining the matrix elements 
of the density operator is the basis of the eigenvectors of the angular momentum, 
although in some cases this is not the basis of the energy eigenvectors (see for 
instance the discussion of the Pasclren-Back effect in Sect. 3.4). On this basis, the 
elements of the density matrix are 



p (aJM, a'J'M') = (aJM\ p\a'J'M'} . 



In this representation - usually referred to as the standard representation of the 
atomic density operator - the diagonal terms, p(aJM,aJM), represent the pop- 
ulations of the various magnetic sublevels, with the overall population nj of an 
assigned J-level given by 



rij = p (aJM, aJM) . (3.89) 

M 

Conversely, the off-diagonal terms p (aJM,a' J' M') represent the so-called coher- 
ences or phase-interferences between different magnetic sublevels. Their physical 
meaning is illustrated, in a qualitative way, by the following example. 

Suppose that a simple atomic system, namely a two-level atom with a lower level 
having J = 0 and an upper level having J = 1, is excited by a stationary radiation 
field tuned at the frequency of the atomic transition. If we choose a quantization 
direction for the atomic system, and take a reference system (a :yz) with the z-axis 
in the quantization direction, the eigenvectors \a.JM') are of the form |1 — 1), 
|1 0), |1 1) for the upper level, and |0 0) for the lower level. 

It will be proved later (see Sect. 7.1) that if the atom absorbs a photon propagat- 
ing along any direction in the x-y plane and linearly polarized in the z-direction, 
it is excited from the level |0 0) to the level |1 0). Similarly, if the atom absorbs a 
circularly polarized photon propagating along the z-direction, it is excited to the 
level |1 1), or |1 —1), according to the handedness of circular polarization. In all 
these cases, the diagonal matrix elements <(1M| p |1M) for the upper level attain 
a definite value, but the coherences remain zero. 
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For any other incidence direction, or any other polarization of the absorbed 
photon, the atom is excited to an upper level which is a linear combination of the 
form 

c_r |1 -1> + Co |1 0> + Cl |1 1> = £ c M |1M> , 

M 

where the three coefficients c M depend on the direction and polarization of the 
absorbed photon. In this case the coherences are non-zero and take the simple 
form 

(1M\ P \1M')=c m c* m , . 

The physical meaning of coherences is just contained in this simple example. We 
will see in Sect. 5.13 how a qualitative analogy can be drawn between atomic 
coherences and interferences in classical oscillators. 

Although coherences can be defined for any pair of magnetic sublevels, no matter 
how large is the energy difference between them, the most important are those 
between degenerate or quasi-degenerate sublevels. For such coherences one can 
introduce more compact symbols to simplify the notations; for instance, for the 
magnetic sublevels pertaining to an assigned term a one can write 

(aJM | p | aJ'M'y = p a {JM , J’M') , 

and if the term is described by the L-S coupling scheme 

(0LSJM | p | fiLSJ'M'y = p 0LS (JM, J’M') . (3.90) 

For the coherences between magnetic sublevels of the same J-level the following 
notation can be used 



<a JM | p | aJM'} = p aJ (M, M') , (3.91a) 

and for the diagonal elements 1 

(aJM\ p | aJM) = p aJ (M , M) = p aJ (M) . (3.91b) 

The atomic density matrix can be defined in the presence of hyperfine structure as 
well. With obvious notations, for the matrix elements between different hyperfine- 
structure magnetic sublevels one can write in general 

<aJIFf\ p \o! J'lF'f'y = p ( aJIFf , a'J'IF'f) , 

and for the coherences between hyperfine-structure sublevels belonging to the same 
J-level 

<aJIFf\ p \aJIF'f > = p aJI (Ff, F'f) . (3.92) 

The more compact notation p aJ (M) can be conveniently used in the physical situations 
where all the off-diagonal elements vanish. 
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Important relations can be established between the different quantities here in- 
troduced, like for instance between those defined in Eqs. (3.91) and those defined 
in Eq. (3.92). We can consider an atom with lryperfine structure as consisting of 
two separate parts, characterized by the eigenstates a JM) and I Mj ) (where M r 
is the projection of the nuclear spin along the z-axis) and by the density operators 
p (J) and p (I \ respectively. Since the quantum number / of the atomic system is 
fixed, we have from Eq. (3.87) 

E <IM I \p\IM I } = p< j >, 

Mi 

where p is the density operator of the whole system. Taking the matrix element of 
these operators between the eigenstates | aJM') and | aJM") we get 

(aJM\p (J) | aJM’} = E (aJIMM I \p\aJIM’M l '> , 

M I 

which relates the density-matrix elements in Eqs. (3.91) to those in Eq. (3.92). In 
the following we will use this kind of relations by dropping the apex (Jj, and we 
will simply write 



P aJ (M, M') = E (aJIMM I \p\aJIM'M I ) . 

Changing the coupling scheme via Eq. (2.12) we have 

p aJ (M, M') = E (JIMM, | JIFf) {JIM'M, | JIF'f) 

FF'ff'M j 

x (a.JIF f \ p \aJIF'f'y , 

or, in terms of 3 -j symbols 

p aJ (M,M')= E (-1) / - / V(2F + 1)(2F' + 1) 

FF'ffMj 



( J 


I 


F \ 


( J 


I 


F' \ 


\M 


Mj 


~f) 


\M' 


Mj 


-r) 



x P aJI (Ff, F' f') . (3.93) 

A similar relation can be established between the density-matrix elements relative 
to different J-levels of a term (Eq. (3.90)) and the density-matrix elements that 
one can define for the same term by neglecting its fine structure. Setting first 



<0LM L \p \(3LM' l } = p 0L (M L , M' l ) , 
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where M L is the projection of the orbital angular momentum on the 2 -axis, and 
then introducing the inner structure due to the spin, one obtains 



P PL {M L ,M' L )= Y, (— 1) m - m V(2J + 1)(2J' + 1) 



( L 


s 


J \ 


( L 


S 


J ' \ 


\M l 


M s 


-M ) 


\M'l 


M s 


-M' ) 



X P/3ls(JM, J'M') . (3.94) 



We want to remark that the density-matrix elements on the basis of the eigen- 
vectors of the angular momentum depend on the reference system chosen to define 
such eigenvectors. If R is the rotation that brings a reference system (the ‘old’ 
one) into another reference system (the ‘new’ one), the eigenvectors of the angular 
momentum in the new system are connected with those in the old system by the 
relation 

\JM} new = D(R)\JM} old , 

so that the transformation law for the density-matrix elements is the following 



p(aJM,a'J'M')\ 

J new 

= new<«^l P Ic/J'A Onew = old<«^l ^ P D W J' M% u 
= Y V nm{RTK'm'{R) \p(aJN,a'J'N')] 



NN' 



J old 



(3.95) 



In particular, for the matrix elements diagonal with respect to a and J 
[p aJ (M,M')l = Y Km(RYK'M'(R) \p a j(N,N')} 



NN' 



J old 



(3.96) 



3.7. Multipole Moments of the Density Matrix 

As shown by Eq. (3.95), the transformation law for the density-matrix elements 
on the basis of the eigenvectors of the angular momentum involves the product of 
two rotation matrices. We can however construct - similarly to Sect. 2.7 - linear 
combinations of these matrix elements whose transformation law involves just one 
rotation matrix. By so doing we obtain the irreducible spherical components of the 
density matrix, which are often referred to as the multipole moments of the density 
matrix or the spherical statistical tensors. 
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Defining the multipole moments of the density matrix by the expression 1 
p«(aJ,a'J') = J2 (-l) J - M V2K + T 

MM' 

M —M' ^q)p^JM,c/J'M% (3-97) 

it is easy to prove that, under the rotation R of the reference system, the multipole 
moments change according to the law 

'p«(aJ,a'J') } =Y\pQ'(a J ,a'J') 1 V« Q {R)* . (3.98) 

L ^ J new — L ^ J old 

Q' 

The proof of Eq. (3.98) will not be given here, being quite similar to an analogous 
proof given in Sect. 2.7. We just want to remark that the factor \/2I\ + 1 in 
Eq. (3.97) is inessential: in fact, if this factor is replaced by an arbitrary function 
of K , Eq. (3.98) is still satisfied. The definition given in Eq. (3.97) is the same as 
Omont’s (1977), but different authors give different definitions for the multipole 
moments, and no definition has a special advantage. 

Comparison of Eqs. (3.98) and (2.78) shows that the transformation law for the 
multipole moments involves the complex conjugate of the rotation matrix, whereas 
the transformation law for irreducible spherical tensors involves the rotation matrix 
itself. Thus the multipole moments - as defined in Eq. (3.97) - are not, strictly 
speaking, irreducible spherical tensors. A different definition could be given by 
substituting p ( a' J' M ', aJM) for p ( aJM , a'J'M ') in Eq. (3.97), and the multipole 
moments so defined would indeed satisfy the transformation law for irreducible 
spherical tensors. However, this alternative definition has no advantages over ours. 

Equation (3.97) can be easily inverted using the orthogonality relations of the 
3 -j symbols. From Eq. (2.23b) one obtains 

p ( aJM , a'J'M') = ^ (-1)' J ~ M V2K + 1 

KQ 

x(/ f - M > (3-99) 

The conjugation property of the multipole moments follows from Eq. (3.97) and 
from the Hermitian character of the density operator 

p«(aJ,a'J'y = (-1 y- J ’-Qp« Q (a'J',aJ) . (3.100) 

Similarly to Sect. 3.6, one can introduce shorthand notations for the multipole 
moments when dealing with restricted subspaces. For instance, for the multipole 
moments relative to the magnetic sublevels of a term one can write 

p%(aJ,aJ') = a p%(J,J') , 

Note that for any atom (or ion) the quantum numbers J and J' are both integers or both 
half-integers, thus the rank K of the multipole moments is always an integer. 
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and for the magnetic sublevels of a given J-level 
p%(aJ,aJ) = pg(aJ) 

= E (-1) j ~ m V2K + T ( J m J m , K q '] Paj(M, M') , (3.101) 

with the conjugation property 

p«(ajy = (-lfp« Q (aJ) . (3.102) 

The analytical expressions for the most common multipole moments, as functions 
of the ordinary density-matrix elements, are given in Tables 3.6 and 3.7. 

The multipole moments can be also defined for a J-level having hyperfine struc- 
ture. With self-evident notations, we have 

aJI p%(F,F') = Y, (-l) F ~ f V2K+l 
ff 

X F ' f , — Q^j PaJl(Ff,F'f') . (3.103) 

Relations similar to those in Eqs. (3.93) and (3.94) can be easily established for 
the multipole moments. From Eq. (3.93), after some Racah algebra involving the 
contraction of 3 -j symbols (Eq. (2.42)), one obtains 

Pq(<xJ) = E (-l) J+/+F ' + V(2f + 1)(2F' + 1) 

FF' 

X { F J F J F } aJI pQ( F F') ■ (3.104) 

In a similar way, from Eq. (3.94) the following relation can be proved 

Pq(PL) = E (-l) i+S+J ' + V(2J + l)(2J' + l) 

JJ' 

X {j L *} 0LS PQ( J ’ J '^ (3-105) 

where Pq(/3L), the multipole moment for a spinless J-term, is defined by 
Pq(0L) = E (-1) L ~ Ml V 2K+1 ( L _ L m , K) PflL (M L ,M' L ). 

M r M' V L L V / 



An important property of the multipole moments follows from the evaluation 
of the expectation value of an arbitrary spherical tensor operator Tq . Denoting 
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TABLE 3.6 



Analytical expressions for the multipole moments Pq (a J) as functions of the matrix elements 
p aJ (M, M'). The index (aJ) is suppressed for conciseness. The multipole moments for negative 
values of Q can be obtained through the relation = (—1)^ Pq{olJ)* . 



J = 0 


P° 0 =P( 0 , 0 ) 




P 0 ~ V 2 [‘ 
p o = 72 [' 
p\ = -p(i 


p(§.§) + P(-|>- 5 )] 
p(§>§)-P(- 5 >- 5 )] 


1 


p 0 = 7 f [' 

p 0 = 75 [' 

p l = -75 
p o = 7 e [' 

^ = ~75 
p| = p(l, - 


p(l,l)+p(0,0)+p(-l,-l)] 

p(l,l)-p(-l,-l)] 

[p(l,0) + p(0,-l)] 
p(l,l)-2p(0,0)+p(-l,-l)] 
[p(l, 0) - p(0, — 1)] 

-1) 


J- 3 
J 2 


Po = 1 [p( 
p 0 = 
d = 

Po = 1 [p( 

^ = - 7 f 

p 2 = 7 f l 
O 3 ~ -i- 

p 0 2 s /5 

* = ~ 3 l 

p 2 = 75 [' 

p§ = -p(l 


|.|) + P(|>|)+P(-|>- 3 ) + P(-|>-|)] 
[ 3 p(|, |) + p(|, 5) - P(-|> “§) ^ 3 P(-|> -§)] 
= [\/ 3 p(|, i) + 2p(|,-i) + a/ 3 p(-|, -§)] 
§>§) — P( 3 ’§)— P(-§> — §) + P(- §>-§)] 

[p(§> |) -P(-|.-|)] 

p(!.- 5 )+p(|>-|)] 

[p(|. |) - 3 P(§’ 5) + 3 p(-5>“3) -p(- §>-§)] 

[p(§> 5) - v / 3 p(l,-i) + p(-|,- §)] 
p(§.- 5 ) -P(|.-|)] 

■-“§) 


J = 2 


^0 = 7S [' 

p 0 - v^o 
p l = “To 
p 0 = 


p(2, 2) + p(l, 1) + p(0, 0) + p(- 1, -1) + p(— 2, -2)] 

[2 p(2, 2) + p(l, 1) - p(-l, -1) - 2p(-2, -2)] 
j [v^p( 2 , 1 ) + \/ 3 p(l, 0 ) + V 3 p( 0 , - 1 ) + \Pl p( — 1 , - 2 )] 
[2 p(2, 2) - p(l, 1) - 2 p(0, 0) - p(- 1, -1) + 2 p(— 2, -2)] 



p l = -7u |VM2,1)+p(1,0)-p(0,-1)- VM-1.-2)] 

p2 2 = T7 W^p( 2 ’ 0) + V3p(l, -1) + V2p(0, -2)] 
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TABLE 3.6 



(continued) 



J = 2 


Po = Vm [p(2, 2 ) — 2 p(l, 1 ) + 2 p(— 1 , — 1 ) — p(— 2 , — 2 )] 




P? = -^5 [v / 3p(2, 1) - v^p(l.O) - V2p( 0 , -1) + V3p(-1, - 2 )] 




Pi = 75 [p( 2 , 0 ) - p( 0 , - 2 )] 




Pi = -73 [p( 2 ,-l)+p(l,- 2 )] 




P 4 o = [p(2, 2) — 4p(l, 1) + 6 p(0, 0) — 4p(— 1, — 1) + p(— 2, —2)] 




Pi = -^4 [^ 2 - !) - V6p(l,0) + i/6p(0, -1) - p(-l, -2)] 




p| = -i= [ v / 3p(2,0)-2 v / 2p(l,-l) + ^3p(0,-2)] 




pi = -75 [p(2> — 1 ) — P(l> —2)] 




II 

JO 

1 

to 



by (Tq ) its expectation value, and referring, for simplicity, to an atom devoid of 
hyperfine structure, we have 

(T«) = Tr(pT«)= <a'J'M'\pT«\a'J'M') . 

oc'J'M' 

Introducing between p and Tq the completeness relation 

1= \aJM)(aJM\ , 

aJM 



and applying the Wigner-Eckart theorem (Eq. (2.96)) and Eq. (3.99), one obtains 

( T q)= E m (uJ\\T K \WJ'}pK(.ala'jy , (3.106) 

qq ' J J' 

which shows that the expectation value of a spherical tensor operator is strictly 
connected with the multipole moments of the same rank. 

Let’s now consider the physical meaning of the multipole moments. For the 
0-rank moment, we have from Eq. (3.101) 

^) = ™ErfM), (3.107) 

so that the overall population n aJ of the (a J)-level is given by 



n aJ = V2J + 1 p° 0 (aJ) . 



(3.108) 
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TABLE 3.7 

Analytical expressions for the multipole moments a pQ (-/- J') as functions of the matrix elements 
p a (JAd, J' M'). The index a is suppressed for conciseness. The multipole moments a pQ (J 1 ■ J) 
can be obtained through the relation J) = (— 1 y~ J, +Q a p^g(J, J’)* . 

J = 0 p\ (0, 1) = p(0 0,1 — 1) 

J' = 1 pj(0, 1) = -p(0 0, 1 0) 

P-i(0, 1) = p(0 0,ll) 

pi(§.§) = § I + -§•§-§)] 

J, = I Po( I- |) = -75 + 

A(M) = ![V3p(H.§§ )+kH,!D] 
pKil)--p(H-l-l) 

P?d>§) = 5 [V3p(H>|-§)-P(5-5>§-|)] 

Pod’ I) = _ 75 [p(H>H)-p(5-5>§-5)] 

P?_ld>§) = 5 [p(H> I §)-V^P(| -§>§§)] 

c 2 f I 3-, _ /I _ 1 3 3-, 
r — 2\ 2 ’ 2 ' 2 2’’ 2 2' 

J = 1 Pl(l’ 2 ) = 7^5 [p(! L, 2 0) + V3p(l 0,2-1) + x/Ml -1,2-2)] 

J' = 2 pj(l,2) = -^= [v / 3p(H,21) + 2p(10,2 0) + V3p(l -1,2-1)] 

pLi(l,2) = ^ [v / 6p(H,22) + V3p(10,21) + p(l-l,20)] 
p|(l,2) = -^[p(ll,2-l) + v^p(10,2-2)] 
pf (1, 2) = [ v / 3p(ll,20) + p(10,2-l)- x/2p(l-l,2-2)] 

pg(l, 2) = - -±= [p(l 1,21) - p(l -1,2 - 1)] 

P?_i (1, 2) = -j. [V2p(l 1,22) -p(l 0,21) - x/3p(l -1,20)] 

P^zt 1 ’ 2 ) = 7f [\/2 p(l 0, 2 2) + p(l —1,21)] 

p|( 1, 2) = p(l 1, 2 —2) 

Pi (1,2) = [t/2 p(l l, 2 — 1) — p(l 0, 2 —2)] 

Pi (1, 2) = [\/6p(l 1, 2 0) -2 n/2p( 1 0,2-1) +p(l -1,2-2)] 

pg(l, 2) = - [p(l 1, 2 1) - V3p(l 0, 2 0) + p(l - 1, 2 - 1)] 
pii(l,2) = -^= [p(l 1,2 2) -2v / 2p(l 0,21) + x/6p(l- 1,2 0)] 

Pl 2 (l,2) = ^ [p(l 0, 2 2) — -v/2 p(l —1,2 1)] 

Pl 3 (l,2) = p(l -1,2 2) 
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To give a physical interpretation to the other multipole moments, let us consider 
the spherical tensor operators Jq that can be constructed with the rectangular 
components J x , J y , and ,J Z of the angular momentum according to Eqs. (2.82) and 
(2.84). Assuming for simplicity the density-matrix elements to be non-zero for just 
one (aJ)-level, from Eqs. (3.106) and (2.97) we obtain 



M> = <A> = + + el («J) 

(Jl) = (-^(A + i J,)} = V / J ( J + 1 H 2J + D A{aJ) . , 

which give after inversion 1 

pl(aJ) = V3- 



1 



Re 



Im 



p\(aJ) 

p\{aJ) 



\J J (</ + 1)(2J + 1) 

/3 1 



(J z ) 



2 V / J(J + 1)(2J+1) 
3 1 



< 4 > 



2 V / J(J + 1)(2J+1) 



a>- 



(3.109) 



These formulae show that the multipole moments of rank 1 are connected with 
the average value of the angular momentum components on the axes x, y, and z. 
When such values are non-zero, there is a preferred direction in space, identified 
by the vector ( J ), along which the atom may be thought to be oriented. For this 
reason the multipole moments of rank 1 are called the orientation components of 
the density matrix. 

For the tensors of rank 2 one obtains, again from Eq. (3.106) 2 



pl(aJ) = V5 /(J) <2 Jl - Jl - J 2 ) 
15 



Re 



Im 



Re 



Im 



p\{aJ) 

p\{aJ) 



f(J)(J z J x + J x J 2 ) 
/(J) {Jyj z + J Z Jy) 



pt(aJ) =\- 



p 2 (otJ) — \l 2 /(^) {^x^y Jy^x) 



(3.110a) 

(3.110b) 

(3.110c) 

(3-110d) 

(3.110e) 



Note that, for J = 0, the quantities pQ are identically zero as it follows directly from 
Eq. (3.101). Equations (3.109) are valid for J / 0 only. For the same reason Eqs. (3.110) are 
valid for J > 1 only. 

2 Note that the operators ( J.), (i, k = x, y, z) are Hermitian, so that their expectation 
values are real. 
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Fig. 3. 13. To get an intuitive grasp of the meaning of the orientation and alignment components of 
the density matrix, we can regard the atomic angular momentum as having a uniform probability 
of being found on the surface of the figures drawn above. For the sphere in panel (a), the atom 
is neither oriented nor aligned; for the ellipsoid in panel (b) the atom is aligned but not oriented; 
finally, for the ellipsoid in panel (c) the atom is both oriented and aligned. To have an example of 
an atom oriented but not aligned we can go back to panel (a) and regard the angular momentum 
as having more probability of being found on the upper hemisphere than on the lower one. 



f{J) = 



/ ./(J-rl)(2J - 1 )(2 J+ l)(‘2J + 3} ’ 



the reduced matrix element {aJ|| J 2 ||aJ) having been evaluated via Eqs. (2.102) 
and (2.36h). 

The formulae now deduced show that the multipole moments of rank 2 are con- 
nected with the average value of bilinear combinations of the angular momentum 
components J x , J y , and J z . These multipole moments are called the alignment 
components of the density matrix. The word ‘alignment’, as opposed to ‘orienta- 
tion’, is used in this context to mean that the atom may be regarded as aligned 
with a particular straight line in space, irrespective, however, of which direction is 
chosen on the line. In Fig. 3.13 we give an intuitive illustration of the meaning of 
the multipole moments of rank 1 and 2. 

Turning to multipole moments of higher rank, it is possible to show that those 
of rank n are connected with the average values of polynomials of degree n in 
the angular momentum components J x , J y , and J z . No particular name has been 
assigned to these moments. 
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Another general property of multipole moments, which follows directly from the 
definition (3.97), should be remarked: a moment of the form pg ( Q ^ 0) is a 
linear combination of coherences between states characterized by projections of the 
angular momentum differing by Q; a moment of the form p ^ is a linear combination 
of populations. 

Finally, some inequalities can be established for the multipole moments as a 
consequence of Eq. (3.85). Taking the square modulus of Eq. (3.101) and summing 
over K and Q we obtain 

E ldqM)| 2 = E 

KQ MM' 



on the other hand, from Eqs. (3.107) and (3.85) we have 

si 2 1 



Po( aJ ) 



2J+1 



E P aJ (M, M) p aJ (M', M') 



MM' 



> 



1 



2J+ 1 



E |p aJ (M,M')| 2 , 



MM' 



and from these two relations we get 



EE I Pq(vJ)\ 2 <2 J [p° 0 (aJ) 

KrjL 0 Q 



Similarly, for the multipole moments connecting two levels (aJ), (ct r J 1 ) one obtains, 
starting from Eq. (3.97) 



E \p%(aJ,a'J')\ 2 < V(2 J + 1)(2 J' + 1) p° 0 (aJ) p° 0 (a'J') . 

KQ 

Another important inequality can be established for the quantities Pq (aJ). From 
Eq. (3.101) we have 

^(oJ) = ^E(- 1 ) J_M (i _ J M *)p a j(M,M). 

M ' ' 



On the other hand, all the quantities p aJ (M,M ) are non-negative (see Eq. (3.81)); 
since their sum equals [2 J + l] 1 / 2 p°(aJ), we can write 



\Po(aJ)\ < v / (2^+1)(2J + 1) 



( J 


J 


K \ 


\M 


—M 


o ) 



p° 0 (aJ) , 



where M is the M-value giving the largest absolute value for the 3 -j symbol. 
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QUANTIZATION OF THE ELECTROMAGNETIC FIELD 
(NON-RELATIVISTIC THEORY) 



Many of the physical phenomena involved in the generation and the transfer of po- 
larized radiation in spectral lines can indeed be described by means of the classical 
theories of the electron and of the radiation field. In Chap. 5 we will apply these 
theories to obtain a simple derivation of the radiative transfer equations for polar- 
ized radiation and of the law of resonance scattering in the presence of a magnetic 
field. 

However, only the most simplified cases are accurately described by the classical 
theory. As far as the atomic system is concerned, the classical theory of the elec- 
tron gives an appropriate representation only of a two-level atom with a normal 
Zeeman triplet and an unpolarized ground level, so that its applicability is very lim- 
ited. Similarly, the classical description of the radiation field prevents the correct 
treatment of important phenomena such as, for instance, stimulated emission. 

In the main body of this book the interaction between atomic systems and the 
electromagnetic field will be described through a full quantum-mechanical formal- 
ism: this will allow us to handle even the most complicated physical situations 
without being obliged to introduce additional terms into the relevant equations by 
using phenomenological or heuristic arguments. To this aim, we need however the 
formalism of second quantization for the electromagnetic field, a formalism that we 
are going to develop in this chapter. 

We also want to remark that the formalism of second quantization for the radia- 
tion field is a classical subsection of quantum field theory and has been fully treated 
in excellent monographs (see for instance Dirac, 1935; Heitler, 1954; Aklriezer and 
Berestetskii, 1965). The reason why these concepts are again developed in this 
book is to establish on a firm basis a set of notations that will be employed in 
the following. On the other hand, in view of the physical applications that will be 
presented later, the formalism of second quantization needs not to be developed 
in its general, relativistically invariant form. We will thus restrict attention to 
the simpler non-relativistic formulation, while we refer the reader to the textbooks 
quoted above for an introduction to invariant formalisms. 



4.1. Quantization of the Harmonic Oscillator 

The quantization of the harmonic oscillator is at the basis of the formalism of 
second quantization, because, as it will be shown in the following, the Hamiltonian 
of the electromagnetic field reduces to the sum of an infinite number of independent 
Hamiltonians each formally equal to that of the harmonic oscillator. 
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The classical Hamiltonian of a one-dimensional harmonic oscillator, expressed as 
a function of the canonical variables q and p, is given by 

rr/ \ P 2 rrujj 2 o 

Hiq - P) =2^ + —' » ’ {iA) 

where m and to are the mass and the angular frequency of the oscillating particle, 
respectively. The Hamilton equations give for the particle motion 

dq = i dH(q,p) = p_ 
d t dp m 
dp dH(q,p) 2 

ck = d^ = ~ mUJ q - 



To solve these coupled equations we introduce, following Dirac (1935), two linear 
combinations of the variables q and p of the form 

a = A (p — i muq) , a* = A (p + i muiq) , (4-2) 

where A is a real constant which, for the moment, does not need to be specified. 
Taking the time derivative of these quantities we get the equations 



which have the obvious solution 



with o 0 to be determined from the initial conditions. Inversion of Eqs. (4.2) yields 
the solution to the classical problem 



q = — t (a — a*) 

2 mwA y 



P= 2A {a + a *) 



with a and a* given by Eqs. (4.3). 

The corresponding quantum-mechanical problem can be treated along the lines 
of the Correspondence Principle. The Hamiltonian is still given by Eq. (4.1), but 
q and p are now Hermitian operators obeying the commutation rule 

[q,p\ = • 



To solve the quantum-mechanical problem we introduce the operator a defined as 
in Eqs. (4.2) and its Hermitian conjugate a ' defined as a* in Eqs. (4.2). Expressing 
the Hamiltonian in terms of a and a) we obtain 
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and evaluating the commutator of a and o' we get 

[a, a*] = 2A 2 mhuj . 



We now choose the constant A in such a way as to make this commutator equal to 
unity; expressing a a' in terms of a' a we get 

H = Tito 

with 

[a, a f ] = 1 . (4.4) 

To find the eigenvalues and the eigenvectors of H , first of all we observe that if 
|u) is an eigenvector of H corresponding to the eigenvalue v, 




H\vy = v |r>> , 



we have 



<„|at«|„> = <*| £ - i |„> = (£; - j) <•!«>. 



Since the left-hand side of this equality is the square modulus of the vector a |u>, 
it must be non-negative, so that we obtain 



v > — Huj , (4-5) 

which means that all the eigenvalues are larger than hui/2. 

Next we observe that, owing to the commutation relations 

[ H , a] = —hu>a , [H, a^] = Tiloo) , (4-6) 

we have 

Ha |u> = ( aH — Tito a)|u) = (v — hu>) a |r;) , 

which means that the vector a |u) is an eigenvector of the Hamiltonian H corre- 
sponding to the eigenvalue ( v — Tuo) . 

Similarly, it can be easily shown that the vector a 2 \v') is an eigenvector of H 
corresponding to the eigenvalue (u — 2 Tuo), and, in general, that the vector a n |v) is 
an eigenvector corresponding to the eigenvalue (v—nhu>). The chain of eigenvectors 
a |u), a 2 |r;), . . . , a n |r;), . . corresponding to smaller and smaller eigenvalues, must 
however be limited, because otherwise property (4.5) would be violated. This 
means that it must exist an eigenvector, which we denote by the symbol |0), such 
that 

a |0> = 0 . (4.7) 

For this eigenvector we have 




= 2 Sw l°> 



H |0> = hto 
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Conversely, from the eigenvector |0) one can construct different eigenvectors 
corresponding to larger and larger eigenvalues. In fact, from Eq. (4.6) we obtain 

Ha) |0> = ( a)H + Tiuj a^) |0) = ^ Uu a^|0> , 

and, in general 

Ha) n |0> = [n+^j Swa tu |0> . 

The eigenvector a^ n |0), however, needs to be normalized. Assuming the eigenvector 
|0) to be normalized, 

< 0 | 0 > = 1 , 

it is easy to prove via Eqs. (4.4) and (4.7) that 

<0|a n a tn |0> =n\ . 

The normalized eigenvectors of the Hamiltonian 77, which we denote by the symbol 
|n), are therefore defined by 



l n > 




and are such that 



H\n} 




a |?r) = \fn \ n — 1) 



a^|?r) = Vn + 1 | n + 1) . 



(4.8) 



To conclude, the energy spectrum of the Hamiltonian of the harmonic oscillator 
consists of a series of equispaced energy levels of the form (n + 1/2) froj. The integer 
n can be regarded as the number of a sort of particles, or discrete quanta, of energy 
hu> that can be created or destroyed and whose presence contributes to the total 
energy of the harmonic oscillator. Formulae (4.8) show that the operator a, when 
acting on an eigenvector, has the effect of decreasing the number of particles, while 
the operator a) has the opposite effect of increasing it. For this reason the operators 
a) and a are called the creation and the destruction (or annihilation ) operators, 
respectively. 



4.2. The Electromagnetic Field as a Superposition of Plane Waves 

Consider the electromagnetic held enclosed in a cubic cavity having its sides equal 
to L. In the absence of charges and currents, one can choose a special case of the 
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Lorentz gauge where the scalar potential t) is identically zero, so that the held 
is described only by the vector potential A(r, t) which obeys the wave equation 

V 2 A(f, t) - ^ JL A(r, t) = 0 . (4.9) 

The Lorentz gauge is now expressed by 1 

div A{r, t) = 0 . (4-10) 

The electric and magnetic fields are connected with A(r , t) by the relations 

E(r,t) = -- 

c ot 

B(r,t) = curl A (r, t) . (4-11) 

At any given time t the vector potential, as well as any other physical function of 
r, can be expanded in Fourier series. If we require that the vector potential obey 
the so-called periodicity conditions 

A(x , y, z, t) = A(x + m x L, y + m y L, z + m z L, t) 

with m x , m y , m z arbitrary integers, we obtain 



A(r,t) = ^C fc -(t)e ifc> , (4.12) 

k 

where the summation is extended to all the values of k satisfying the periodicity 
conditions 

^ / 2ir 2i 2;r\ 

k = I n x — ,n y — ,n z — j , (4.13) 

with n x , n y , n z arbitrary integers. Because of the real character of the function 
A(r, t), the complex vector C^(t) satisfies the conjugation property 

c % {ty = . 

We now require that the vector potential A obey the wave equation (4.9). This 
means 

£ 3 ^— «§<?!<*>. U14) 

where 

u^ = c\k\. (4.15) 



1 



Note that Eq. (4.10) is not relativistically invariant. 
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Equation (4.14) can be easily solved to give 

/?(-) 



C s (t) = <7i-> e _i “ st + Ci +) e™** 



where C~ ) and Ci +> are constants. Substituting into Eq. (4.12) we obtain 



Ar, t) = A 



: ( _) i (k-r-ugt) 






i(fc-r+wct) 



(4.16) 



with 



(7 (_) * = (7 (+) 
k —k 



The vector potential in Eq. (4.16) is decomposed into progressive and regressive 
waves. Taking into account that u> t = we write the exponential of the regres- 
sive wave term in the form 






and then we change the summation index of this term from k to -k to obtain 



The supplementary condition (4.10) implies, for any k, 

= 0 , 

k 

which is a transversality condition holding for any Fourier component of the vector 
potential. This relation can be satisfied in the following way. For any wavevector 
k we define two complex unit vectors eV, (A = 1,2) both perpendicular to k and 
perpendicular to each other, 



Ax ■ k = 0 . 



Ax ■ Ax' = 



(4.17) 



Next we write 



Ci- ] e~ iURt 



E C kX^k) Ax ’ 

A=l,2 



where cp, (t) are oscillatory functions satisfying the differential equation 



and we finally obtain 




- iw fc C fcAW ’ 



A(r, t) 



E 



C kX^k) Ax 



\k-r 

e 



C kxA) 




(4.18) 



(4.19) 
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from which the electric and magnetic fields can be obtained via Eqs. (4.11) 



E(r,t) = 

c ^ 

k\ 

B(r, t) = i^2k 



z _» i k-r /lNsk _»* — i k-r 

c aW e a e — c Sa(*) e k\ e 



k\ 



c ea(*) Hx e ' k " - c# e 



— ife-i 



(4.20) 



The summations are extended to all the wavenumbers satisfying Eq. (4.13) and, 
for each wavenumber, to the two possible polarization states. Each set of values 
(k, A) defines a mode of the radiation field inside the cavity. The number of modes 
for which the modulus of k is contained within k and k + d k and its direction is 
contained in the solid angle clfi is given by 

N(k) d k dfi = 2 k 2 dk dfi ( — ^ =^V k 2 d k clfi , (4.21) 

V 27 r / 47r d 



where V = T 3 is the cavity volume. 



4.3. Quantization of the Electromagnetic Field 



We can now introduce the formalism of second quantization by interpreting the 
coefficients cr.(t) and crJt)* in Eqs. (4.19) and (4.20) as two operators cr, and 
cl acting on a suitable Hilbert space. By so doing, the vector potential and the 
cfectric and magnetic fields become quantum operators acting on the same Hilbert 
space. 

The Hamiltonian of the electromagnetic field can be found with the help of the 
Correspondence Principle, starting from the classical expression 

H = — [ (E 2 + B 2 )dV (4.22) 

8tt Jv 



which gives the energy of the field contained in the volume V. Substituting for E 
and B their expressions given in Eqs. (4.20) (with the coefficients cr.(t) and crAt)* 
replaced by the operators cj A and cl , respectively) the integral in Eq. (4.22) 
reduces to the sum of integrals of the form 



e KUk').f dV 



which can be easily evaluated to give, owing to the periodicity conditions (4.13), 
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We thus obtain 



H = ~^y] 

Rtt ^ 



Sn 



kXX' 



(^kx ' ^-kX') $ x e^ A ) • (k x e_£ A ,) 



+ ct c f - 
fcA —kX' 



— c- c- 
fcA kX' 



LU 






77 <Ta ■ e £v) + ( fc x %) ■ ( k x e 7v) 



— C- C- 
fcA fcA' 



f ( 6 7 a ’ %A') + ( k X ^* A ) ’ ( k X %A') 



(4.23) 



Using Eqs. (4.15) and (4.17) we have that the first two terms in the right-hand side 
identically vanish, while the last two terms give 1 



H = 



V 

47 re 2 k 

kX 



E 



C- ct + ct C- 
fcA fcA fcA fcA 



(4.24) 



Finally, performing the substitutions 



1 /wgV 

27t Ti 






fcA ’ 



1 /u E c t 



fcA c V 2-kTi fcA ’ 



(4.25) 



we can express the Hamiltonian of the electromagnetic field as the sum of an infinite 
number of Hamiltonians (one for each mode) formally identical to the Hamiltonian 
of the harmonic oscillator, 



H = H 






a kx a lx + a lx a kx 



k X 



The commutation properties of the operators ar, and at can be derived from the 
Correspondence Principle. Since these operators are, apart from a factor, the quan- 
tum equivalent of the coefficients crAt) which obey the differential equation (4.18), 
we require that the same differential equation hold for the expectation value of the 
operator aj: x on an arbitrary state vector | (jx) of the Hilbert space, 

^<^l«jE A l0> = <<i>\ a kx\^y ■ 



The cross products in Eq. (4.23) can be easily evaluated using the relations 

( [ Sx ^)i=J2jk e ijk a j b k - Tfc e ijk £ lmk = S il S jm - 5 im S jl > 

where the antisymmetrical tensor is defined on p.7. 
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Since the state vector | </>) evolves according to the Sclrrodinger equation 



we obtain 



\<t>} = H 10 ) 



(<t>\ i a k\’ H } ir> = nuj k <^1^1^) > 



or 



E 9 <*l Kv a fc'A'4'A' + 4v°fc'A'] l^> = <^l a fcA^> 



fc'A' 



Being |<^>) an arbitrary state vector, it can be proved that this equation is satisfied 
if 



[ a ifcA’ a fc'A'] — 4v] ^ 



fcA’ fc'A' 

when the operators refer to different modes (k ^ k' or A ^ A'), and if 

4a’ “ tj = 1 • 

The Hamiltonian H can then be expressed in its final form 

1' 



H = V Tilo? I at a- + 

k \ fcA fcA 2 



k A 



(4.26) 



(4.27) 



(4.28) 



Equation (4.28), together with the commutation rules (4.26) and (4.27), proves 
that the Hamiltonian of the electromagnetic field is the sum of an infinite number 
of independent Hamiltonians, each formally identical to the Hamiltonian of the 
harmonic oscillator. 

If we now arrange the different modes of the electromagnetic field in some definite 
order, and recall the results obtained in Sect. 4.1 for the harmonic oscillator, we 
have that the eigenvectors of the Hamiltonian H can be described by a state vector 
of the form 






4 



I rt ! . 

L* U 2- 



wr 



(4r 



>!)"*■ -to,) , (4-29) 



where (n 1 , n 2 , . . . , n i , . . .) are the occupation numbers and 



ir 0 > = |0, 0, . . . , 0, . . . > 



is the so-called vacuum state. 

The corresponding energy eigenvalues are given by 



H K,n 2 , 



n i5 . . . > = 
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Note that the presence of the term 1/2 in this expression leads to a non-zero (and 
indeed infinite) energy eigenvalue even when all the occupation numbers are zero. 
This represents a zero-point energy that can be eliminated from the theory by 
simply shifting the scale on which energy is measured. By so doing we can write 

F|n 1 ,n 2 ,...,n i ,...> = K,« 2 > • • ■ • • • > • 

3 



The physical interpretation of the formalism introduced above is the following. 
The occupation number n i is the number of photons in the i-tli mode, thus the state 
vector in Eq. (4.29) represents the physical state where n 1 photons are present in 
mode 1, n 2 photons in mode 2, and so on. Since the occupation number n i relative 
to the i-tli mode can take all the values 0, 1,2,.. ., and since, moreover, this number 
is independent of the occupation numbers relative to all the other modes, it follows 
that photons obey the Bose-Einstein statistics. This allows a direct deduction of 
Planck’s law. In thermodynamic equilibrium at temperature T, the probability P n 
(normalized to unity) of having n photons in a given mode of angular frequency 
is given by 



P„ = 



. hui 
\T 



E 



, hui 
\T 



m — 0 



where fc B is the Boltzmann constant, so that the mean number of photons in the 
mode is 

X ^ 

UP n= • 

ra= 0 e \ 

Recalling Eq. (4.21), one obtains that the energy per unit volume with angular 
frequency between u and aj + Alo is given by 



, , / t _ Ar/I . cl k 1 hu> 3 1 

uM = huniN(k) y dfl = 



huj 

e k s T — 1 



and that the energy density with frequency between v and p + dp is 

Aio hv 3 1 

imp) = u(u}) —— = 07 T — ; , 

v x ’ dp hv 



- 1 



which is the classical Planck's law. 

The action of the operators a i and a\ on a given state vector can be deduced 
from Eqs. (4.8) 



a i 


< 

II 

e" 


\n 1 ,n 2 ,...,n i - 1, . . . > 




a] \n 1 ,n 2 , ■ ■ ■ 


A/ 

II 


+ 1 |n l5 n 2 , . . . , + 1, . . 


•> 


4 a i 




|n 1 ,n 2 ,...,n i ,...> . 
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These equations fully justify the denomination of destruction and creation oper- 
ators for a i and <:i [ , respectively. The last equation shows that a\a i is a positive 
definite Hermitian operator whose eigenvalues are the occupation numbers rq. For 
this reason it is named the occupation number operator relative to the z-tli mode. 



4.4. The Stokes Parameters in the Formalism of Second Quantization 



The formalism introduced in the previous sections can be slightly modified to make 
it more suitable for the physical applications that will be presented in Chap. 6. 
Instead of labelling the modes with the wavenumber k we characterize them by the 
frequency v and the direction fi. Thus we replace the operators a^ x and at by 

the operators a(u, Q. A) and a){v, fi, A), respectively. Moreover, instead of the unit 
vectors eV, we will use the unit vectors e A (fl) which still depend on the direction 
O but not any longer on the radiation frequency. With these notations, and taking 
into account Eqs. (4.25) and (4.15), we can write for the operator A(r) defined in 
its classical form in Eq. (4.19) 1 



A(f) 

where 



and 



E c vS? i a{v ' A) ^ ' + at(z/ ’ ^ e lAr }’ ( 4 - 3 °) 




e x (d) • n = 0 , 



e A (f2)-e A ,(f2)* = S AA , (A, A' = 1,2), 



(4.31) 



a(u, fl, A), a(v', O', A') = a*(zq &, A), a^(z/, fi', A') =0 



[a(u, d, A), a* (is', A')] = 5 VV , 8 M , 5 XX , . (4.32) 

Similarly, for the electric and magnetic field operators we have 

E(r ) = i E \j—\ J~ { a ( ^ A ) e A ($1) e' k ' 

is£l\ 

-a t (zz,f2,A)e A (f2)*e“ ifc ' f? } 



B{r ) = i E \J-y~ { a (^ A ) x e A (^)] 

v£l\ 

— a f (u, Cl, A) [n x e A ((1)*] e _ifc? } , (4.33) 

To avoid any possible confusion, we use in this section the symbol ~ to distinguish the 
operators from the corresponding classical quantities. 
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and for the Hamiltonian (with the vacuum energy removed) 

H = ^ his a^ 1 (is, 0, A) a(is, Cl, A) . (4.34) 

isQ X 



From these expressions it is possible to derive, according to the Correspondence 
Principle, the quantum-mechanical operators corresponding to a number of classical 
dynamical variables. In App. 3 we present such derivations for the momentum and 
for the angular momentum of the electromagnetic field. 

Let us now consider the classical definition of the intensity of a radiation beam, 

^ ^ clS 1 dt dt'dfl 

where d E is the energy crossing the surface dS (perpendicular to the beam) in the 
time interval dt, with frequency between is and v + dp and direction contained in 
the solid angle clfi around the direction Cl. The corresponding quantum operator 
can be derived by the following arguments. The number of modes having frequency 
contained in the interval (is, is + dis) and direction in the solid angle clfi is given, 
for each polarization state, by (see Eq. (4.21)) 

is 2 

N(is) dis clfi = V — dis clfi . (4.35) 

c d 



If n(is, Cl, A) is the number of photons belonging to a particular mode, the number 
of such photons crossing the surface dS in the time interval dt is 



n(is , Cl, A) 



cdf dS 

V 



and since each photon carries the energy his one obtains 

his 3 



I(v,Ci) = — ^2 n(v, Cl, A) . 



X—1,2 



(4.36) 



This expression immediately suggests the form of the corresponding quantum- 
mechanical operator 



_ hy 3 _ _ 

I(is,Cl) = — ^ at (is, Cl, X)a(is,Cl, A) , 

C A=l,2 

and the generalization to the polarization tensor defined in Eq. (1.36) is quite 
obvious 

^ —t hl/3 

4/3 0> = — at 0> a ) a ( $4 P) ■ (4.37) 

The relations connecting the Stokes parameters with the components of the po- 
larization tensor depend on the reference direction chosen to define the Stokes 
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parameters (see Sect. 1.6) and on the specific unit vectors e A (fi) chosen to expand 
the vector potential A(r ) (see Eq. (4.30)). Obviously, both choices are completely 
arbitrary, and the relation between the Stokes parameters and the polarization 
tensor is in general rather involved (see Eqs. (1.42)). It is however convenient, for 
future applications, to define the unit vectors e A (fi) in the following way 



e +i(^) 



e_i(I4) 



1 

71 

i 

71 



-e a (0) +ie b (0) 
Oi(^) + i 0,(f5)l , 



(4.38) 



where e* Q (fi) and e b (fl) are the reference direction unit vector and the associated 
unit vector, respectively (see Sect. 1.6). It can be easily shown that these vectors 
satisfy Eqs. (4.17), namely 



e ±1 (d)-d = 0 

e+i(0) • e +1 (Q)* = e.^d) • e.^d)* = 1 
e + i(f2) • e_i(d)* = el^fi) • e +1 (d)* = 0 . 



Moreover, they satisfy the relation 

e ±i(^) = — e =pi(^) • 

With this choice for the unit vectors e A (f 1) - a choice that will appear clearer 
in the following, see Sect. 5.10 - the relation between the Stokes parameters and 
the polarization tensor can be derived from Eqs. (1-42) substituting 9 = 57t/ 4, 
c j> = — 7r/2 (which makes Eqs. (1-41) coincident with Eqs. (4.38)). We have 

/(zy d) = J ++ (zy O) + I (zy d) 

Q (zy d) = — (zy O) — / h (zy O) 

U (zy d) = — i /_ | (zy d) + i / h (zy O) 

V[y,d) = / ++ (zyd) , (4.39) 

where we have shortened the notations writing I ++ instead of I +[ +1 , and so on. 
Finally, substituting Eq. (4.37) we obtain 

A hi/3 —t —t —t 

/(zy Q) = — — [a^(zy O, +1) a(zy fi, +1) + a^(zy fl, —1) a(zy fl, —1) ] 

Q (zy d) = — — [ a H u > +1) a(zy 7 —1) + at{v, Q, —1) a(zy +1) ] 

^ h is 3 —> —> —> —* 

U(v, d) = — i — — \a\v, d, +1) a(zy Q, —1) — a^(zy d, —1) a(zy d, +1) ] 

V(v, Cl) = — — [a*(zy O, +1) a(zy $2, +1) — a^(zy d, — 1) a(iy d, — 1) ] , 
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which give the Stokes parameters of the radiation held in terms of creation and de- 
struction operators. The connection between the operator V (is, f l) and the angular 
momentum operator of the electromagnetic held is discussed in App. 3. 



4.5. The Density Operator of the Radiation Field 

The density operator p (R) of the radiation held can be introduced using the same 
general formalism presented at the beginning of Sect. 3.6. The definition of p (R> is 
indeed the same as that of Eq. (3.79), 

p (r) = ^ 2 Pa iv’r ) x^r ) i , 



where IV’ifb is now a state vector of the Hilbert space considered in Sect. 4.3. 

A complete basis of this Hilbert space is that of the energy eigenvectors given in 
Eq. (4.29), 

{ I n 1 ,n 2 ,...,n i ,...> } , 

so that the matrix elements of the operator p (R) have the form 



pTn^n 2 „. K,n' = < n l> n 2’ • • ■ - • I P (R) W>«2> 



The expectation value of any dynamical variable can be expressed in terms of 
the density operator p (R) . For instance, the expectation value of the electric held 
in a given point is expressed by 



E(r) = Tr (E(r) p <R) ) , 

and analogous formulae can be written for the other physical variables associated 
with the electromagnetic radiation held. 

In some special cases, the density operator of the radiation held can be written 
as the direct product of an infinite number of operators each spanning one mode of 
the radiation held characterized by the frequency v and the direction Q. 1 In these 
cases we have 



P (R) = P (R Vi,^i) ® p (R) (is 2 ,n 2 ) 






The expectation values of the polarization tensor components relative to the mode 
(u 0 , f2 0 ) are then given by 

UAA) = T r (-4/3 (^o> )d (R) ) = Tr {l a p(v 0 , II 0 ) p (R) (is Q , $1 0 )) > 



The concept of mode is used here in a broader sense than in previous sections. Modes 
are defined here irrespective of the photon polarization, so that, more precisely, they are double- 
modes. 
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or, with shorthand notations, 

4/3 = Tr (4/3 Po R> ) = Tr (4 a /3 Po R> ) . 

where Pq R> is the ‘reduced’ density operator acting on the mode (u 0 , f2 0 ) and where 
aT, and a ^ are the operators cu(u 0 , fi 0 , a) and a(i/ 0 , fl 0 , / 3 ), respectively. 

It should be remarked that the density operator described above is an operator 
acting on the Hilbert space of the occupation numbers. Sometimes, a different 
density operator is introduced, acting on the wavefunction space of the photons 
(Fano, 1949; Blum, 1981). This latter operator will not be used in this book. 
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INTERACTION OF MATERIAL SYSTEMS WITH POLARIZED 
RADIATION (THE CLASSICAL APPROACH) 



In this chapter we will discuss the physical mechanisms involved in the generation 
and transfer of polarized radiation in spectral lines within the framework of classical 
physics. In particular, the classical theory of the electron (already introduced in 
Sect. 3.2) will be used to describe the atomic system interacting with polarized 
radiation, and the electromagnetic radiation field will be described in terms of 
classical electrodynamics. 

The results that will be derived are of limited validity as, in most cases, the 
correct results can only be obtained by a quantum-mechanical treatment. The 
formalism of classical physics is, however, simpler and more transparent than the 
quantum-mechanical one. Thus it is suited to give the reader a plain introduction 
and an intuitive approach to the physical processes that will be treated in full 
generality in the following chapters. Moreover, some of the results obtained from 
classical physics coincide with special cases of the corresponding quantum results. 
And it is indeed illuminating and encouraging to find, as a limiting case of an in- 
volved quantum-mechanical calculation, the same result deduced from the simpler, 
classical formalism. 

This chapter is also intended to show the strict connection between some clas- 
sical and quantum concepts. This subject - generally overlooked in textbooks on 
Quantum Mechanics - can be conveniently illustrated by the physical processes 
described here, such as resonance scattering and the Hanle effect. Although the 
connection can be established only in a qualitative way, it is nevertheless important 
for the correct understanding of polarization phenomena. 



5.1. Propagation of Electromagnetic Waves in Anisotropic Media 

The propagation properties of electromagnetic waves in a material medium can be 
derived, in classical physics, from the Maxwell equations. In the following we will 
restrict attention to a homogeneous medium having magnetic permeability /i = 1, 
so that it is not necessary to distinguish between the two vectors B (magnetic 
induction) and H (magnetic field). 

In such a medium, with no free charges, the Maxwell equations take the form (in 
the Gauss-Hertz system of units) 

divD = 0 



divR = 0 



(5.1a) 

(5.1b) 
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curlB = ig 
c dt 


(5.1c) 


ld ^ 

curl E = — , 

c at 


(5-ld) 



where c is the velocity of light and where the electric displacement D is connected 
with E by the equation 

D = E + 47tP , 

P being the electric polarization. From Eqs. (5.1c) and (5. Id) it easily follows that 

_ _ 1 Q 2 n 

S7 2 E - grad di vE=- — . (5.2) 

c z ot z 

For an isotropic medium, D is related to E by the simple equation 

D = eE , 



where e is the dielectric constant of the medium. Substitution into Eqs. (5.2) and 
(5.1a) leads to the wave equation for the electric field 




d 2 E 

~W 



and to the transversality condition 



divE = 0 . 



A particular solution to these equations is the well-known plane wave of frequency v 
propagating along the direction f \ 



E = E 0 e 



( nr ■ Q—ct) 



(5.3) 



where n = yft is the index of refraction of the medium and E 0 is a constant such 
that 

e 0 ■ n = 0 . 

Note that Eq. (5.3) gives for the spatial evolution of the electric field components 



d£ 1 v 
— = 27ri - n £ • , 
as c J 



where £ • is the component of E along any direction perpendicular to f l and s = r-fl 
is the coordinate measured along the ray path. 
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For an anisotropic medium, like for instance a birefringent crystal, or a vapor 
permeated by a magnetic and/or electric field, the relation between D and E 
becomes of tensorial form, 

v i = £ E i > 

3 

where V i (i = 1,2,3) and £■ (j = 1,2,3) are the Cartesian components of the 
vectors D and E, respectively, and where e i:j is the dielectric tensor. 

From general theorems on matrices it follows that the dielectric tensor can always 
be diagonalized through a suitable similarity transformation. This means that any 
medium is characterized by a triplet of axes u a ( a = 1,2,3), called the principal 
dielectric axes, such that when the vectors D and E are expanded on this basis, 

£ = ( 5 - 4 ) 



one simply has 

V a = t a £ a - ( 5 - 5 ) 

The quantities e a are called the principal dielectric constants. In the special case 
where the medium is a crystal, the dielectric tensor is real and symmetric (Born 
and Wolf, 1964). In this case the principal dielectric constants are real, and the 
three unit vectors u a are also real and form an orthogonal triplet. 

By contrast, these properties break down for absorbing anisotropic media. In this 
case the principal dielectric constants are generally complex and the same holds for 
the unit vectors u a , which no longer are orthogonal. It follows that if we expand 
any vector v on the basis u a , 



'=£ 



(5.6) 



the components v a are given by 



where the unit vectors u' a (a = 1,2,3) are such that 



u' a ' Up — S a p ■ 



These vectors can be explicitly written in the form 1 



(5.7) 



w 1 ^ ^, 1 ^^ 1 _ , 
u 1 = — u 2 x u 3 , u 2 = — u 3 x u x , u 3 = — u l x u 2 , (5-8) 



For a dielectric crystal we simply have u' a = u . In the case of a vapor permeated by a 
magnetic field, it will be shown in Sect. 5.3 that u' = u* 
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where g = u, • [u 2 x u 3 ). 

The general solution to Eqs. (5.2) and (5.1a) in an anisotropic absorbing medium 
is quite complicated, except for some particular directions of propagation and po- 
larization. For instance, if we consider the wave 



E = B 



u n e 



27 ri^r (r 



■ Q—ct ) 



with 

n a = e a ( 5 -9) 

and 

u a ■ a = 0 

we easily see that Eqs. (5.2) and (5.1a) are satisfied. Obviously this is a very special 
case because the wave is polarized along one of the principal dielectric axes and it 
propagates along a direction perpendicular to the same axis. 

Instead of looking for a general solution to the problem of wave propagation 
within the medium, 1 we will confine ourselves to establishing an evolution equation 
for the components of the electric field. Our derivation will be further restricted 
to media having principal dielectric constants very close to unity. In other words, 
defining 

e a = l+L (a = 1,2, 3), (5.10) 

we suppose that |£ a | <C 1. 

Let us now consider a stationary plane wave of frequency v propagating along 
an arbitrary direction fi. The vectors E and D associated with the wave depend 
only on s (the spatial coordinate measured along O ) and on time t, thus Eq. (5.2) 
can be written in the form 



d 2 E 

ds 2 



d^(£.d) 



47T 2 ^ 2 



D . 



(5.11) 



Scalar multiplication by O gives 



D-n = 0, (5.12) 

so that the transversality condition (5.1a) is satisfied. These equations show that 
D is perpendicular to O while E has, in general, a non-zero component along the 
direction of propagation. 

Let’s now introduce two mutually orthogonal unit vectors el perpendicular to 
the direction of propagation, 

e*-e k =5 jk , S..fi = 0 (j,k = 1,2). (5.13) 



This is a classical problem in crystal optics and leads to the so-called Fresnel equation of 
wave normals (see Born and Wolf, 1964). 
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These vectors can be, for instance, the reference direction unit vector e a and the 
associated unit vector e b defined in Sect. 1.6 or, more generally, the complex unit 
vectors e + and e_ defined in Eqs. (1-41). Defining also 

e 3 = e 3 * = Cl , (5.14) 

we can expand the vectors E and D on the basis ( e. l , e 2 , e 3 ), 

£=E^’ ^ = E^> 

i i 

where 

£ i = eT • E , V t = e* • D . 

The relation between the components £ t of the electric field in the reference system 
(e) , e 2 , e 3 ) and the components £ a along the triplet (u 1 ,u 2 ,u 3 ) is the following 

£ i = E ' “a) £ oc > £ a = Y^(K-£i) £ i ■ ( 5 - 15 ) 

cx. i 

Analogous equations obviously hold for the components of the electric displace- 
ment. 

From the transversality condition (5.12) one obtains, using Eqs. (5.4), (5.5), 
(5.15), (5.14), and (5.10)' 

E [E (^3* ’ (K ' £ j) (! + O £ j = 0 ■ ( 5 - 16 ) 

j a 

On the other hand, if we expand any vector e • on the basis u a via Eqs. (5.6) and 
(5.7), 

£ j = E ' S’) “a > 

a. 

we obtain from Eqs. (5.13) and (5.14) 

E (3* ‘ “«) (**« ’ ? j) = 6 ij ’ ( 5 - 17 ) 

a 

therefore Eq. (5.16) can be written in the form 

1 + E ’ \) (*»« • e 3 ) £ a £ 3 = 

a 

= - [E (®3* • \) « • ei) - [E (e 3 * • “J (“a • e 2 ) £ Q £ 2 • 

a a. 

This formula shows that the ratio of the longitudinal component £ 3 of the electric 
field to the transverse components is of order £ a . Since we have supposed |£ a | <C 1, 
we have 



I £ 3 I ^ I £ i I (* — 1,2). 
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Consider now Eq. (5.11). Performing the scalar product by e* (j = 1,2) and 
using Eqs. (5.4), (5.5), (5.10), and (5.15), we get 1 



d% 4tt 2 p 2 

= V 

r 2 3 



ds 2 



4ttV 3 



k—1 cx 

The contribution from k = 3 has the form 
47T 2 P 2 



E E (®7 ' “«) (“a • 4) (! + D £ k (j = !» 2 ) • 



Ec 



e„- • tr 



) («C • 4) C 5; 



3 ! 



and since £ 3 is itself of order £ , this contribution can be neglected according to 
the assumption | £ a | <C 1. Up to hrst-order terms in we thus have 



d 2 £. 

-fa? 



47T 2 P 2 



E E ' 4J (“a • 4) (! + O 






k—1 (X 



To the same order of approximation it can be easily shown, with the help of 
Eq. (5.17), that this equation is equivalent to the following 



= E [E «‘4)( 1 +lQ 



4- 



(5.18) 



k= 1 oc 



The ± sign appearing in this equation is a consequence of the existence of progres- 
sive and regressive waves. If we choose to describe the temporal oscillation of the 
wave through an exponential of the form Q~ 2nwt (consistently with the convention 
of Sect. 1.3), we must take the positive sign. Since from Eqs. (5.9) and (5.10) we 
also have 

l+^a="o ( 5 - 19 ) 

Eq. (5.18) can be finally written in the form 



dS i 

-7T = 2ni 
os 



i ~ c E E ■ 4J («a • 4) n a £ k U = t 2 ) ■ 



/c=l a: 



(5.20) 



This equation expresses the variation along the propagation direction of the trans- 
verse components of the electric held. Its validity is limited to media whose prin- 
cipal dielectric constants differ slightly from unity and it cannot be applied, in 
general, to crystals. The same equation has been used (although without physical 
explanation) by Jefferies et al. (1989) in their derivation of the transfer equations 



l 



The scalar product by e 3 * leads to the trivial identity 0 = 0. 
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for polarized radiation from classical physics. A simpler, heuristic derivation, that 
is however basically inconsistent, has been given by Landi Degl’Innocenti (1992). 



5.2. Transfer Equations for Polarized Radiation 



Equation (5.20) describes the spatial evolution, along the ray path, of the transverse 
components of the electric field associated with a stationary plane wave of frequency 
v. This equation can be rewritten in the form 

£- s S j (8,t) = -Y / G jk £ k (8,t) (j,k = 1,2), (5.21) 

where 

G jk = “ 27ri ^ X) ' ®7) (**« ' ®fc) n a ■ (5.22) 

a 

The tensor G, which will be referred to in the following as the propagation tensor 
of the electric field, depends both on the physical nature of the medium, specified 
by the (complex) refractive indices n a , and on the geometry of the propagation 
specified by the two scalar products. It is worth noticing that in an isotropic 
medium, where n a = n, we get from Eq. (5.17) 

Gj k = -2ni- c n5 jk . (5.23) 

From Eq. (5.21) the transfer equation for the polarization tensor can be eas- 
ily derived. Recalling the definition of polarization tensor given in Chap. 1 (see 
Eq. (1.25) and its generalization in Sect. 1.8) we have 



d r J jk (s + As) - J jk (s) 

' J jk ~ lim 



ds 



As^O 



As 



(£*(s + As,t) £ k (s + As,t)) - (, £*(s,t)£ k (s,t )) 

= lim — 

As^O As 



ds 



ds 



and since the propagation tensor can be extracted from the statistical average 
implied by the symbol (•••), we obtain 



^- s J jk = -iZ{G* l Ji k + G u J jl ) (j,k= 1,2). (5.24) 

2= 1 

To derive the transfer equations for the Stokes parameters we need the relation 
between these quantities and the components of the polarization tensor. We recall 
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that this relation depends on the choice of the unit vectors e) and e 2 defined in 
Eq. (5.13). The easiest choice is 

e i = K > ®2 = 4i 

where e a is the reference direction unit vector and e b the associated unit vector 
(see Sect. 1.6). With this definition the relation between the Stokes parameters 
and the components of the polarization tensor is given by Eqs. (1.34), which can 
be inverted to give 

'hi = 2k ^ + ^ ^ 12= 2 

J22 = 2k (I ~ ^ 21 = 2 + ' 

Substitution into Eqs. (5.24) leads to the following propagation equation for the 
Stokes parameters 

(Vi V q V u Vv \ ( 1 

V Q Vi Pv -Pu Q 

Vu ~Pv Vi Pq U 

V Vv Pu - Pq Vi / \V 

where the seven independent quantities appearing in the propagation matrix are 
given by 

Vi = R e (Gh + G 22 ) 

V q = Re (^11 ~ ^ 22 ) Pq = — I m (G 1:l ~ G 22 ) 

Vu = Re (^12 + G? 2 i) Pu — (^12 + ^ 21 ) 

Pv = (G?i2 — G 21 ) py = Re (G 12 — G 21 ) . (5.26) 

Equations (5.25) and (5.26) are at the basis of radiative transfer for polarized 
radiation and deserve some discussion. First of all it should be emphasized the 
remarkable symmetry property of the 4x4 propagation matrix. This matrix is 
constructed with only seven independent quantities and can be decomposed into a 
diagonal matrix, proportional to rjj , and two off-diagonal matrices, one symmetric 
and the other antisymmetric about the main diagonal. 

Denoting the propagation matrix by K , we have 





K = 



rjj 0 0 0 \ 

0 ?7j 0 0 I 

0 0 ?7j 0 I 

0 0 0 77 j) 



/ 0 Vq Vu Vv\ 
Vq 0 0 0 

Vu 0 0 0 

\Py 0 0 0 / 



/0 0 0 0 \ 

0 0 Pv -Pu I 

0 - Pv 0 Pq I 

'0 Pu ~Pq 0 ' 



(5.27) 
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The first matrix is responsible for the absorption of the energy of the electromag- 
netic wave irrespective of its polarization state. This matrix would produce by 
itself an exponential decrease of the whole Stokes vector, so that the quantity ? 7 j 
can be regarded as the generalization to the polarized case of the absorption coef- 
ficient encountered in the standard theory of radiative transfer. In this respect it 
is worth noticing that for an isotropic medium we simply have from Eq. (5.23) 

v 

rjr = 471 — k , 
c 

k being the imaginary part of the refractive index which is connected with the 
standard absorption coefficient. This first matrix will be referred to in the following 
as the absorption matrix. 

As apparent from Eq. (5.23), both the second and the third matrix are zero for 
an isotropic medium. For reasons that will be clarified in the next section, these 
matrices will be referred to in the following as dichroism matrix and dispersion 
matrix, respectively. 

The expression that we have obtained for the propagation matrix K, and in par- 
ticular its symmetry properties, descend directly from the physical approximations 
adopted. First of all we have assumed the principal dielectric constants e a - which 
describe the dielectric properties of the medium - to be independent of the ampli- 
tude of the propagating electric field. This means that the polarization P induced 
in the medium is linearly related to the electric field itself. The present theory is 
therefore inadequate to treat the physical phenomena connected with non-linear 
optics. Moreover, we have assumed the principal dielectric constants of the medium 
to differ slightly from unity, so that the applicability of Eqs. (5.25)-(5.26) is limited 
to such media. Finally, we have supposed the electromagnetic wave to propagate 
in a homogeneous medium, whose properties do not depend on the spatial coordi- 
nate r. It can however be shown that for non-homogeneous media the equations 
now derived can still be applied provided that 



grad n a <C 



27T 

T 



a condition which is generally well-satisfied for optical radiation propagating in 
astrophysical or laboratory plasmas. 



5.3. Application to Magnetic Lines 



We now apply the results of the previous section to the case of an atomic vapor in 
the presence of a magnetic field. We describe the atom by the classical theory of 
the electron, going back to the oscillator model already used in Sect. 3.2. In the 
presence of a constant magnetic field B and of an oscillating electric field of the 
form 
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the motion of the electron is described by the equation 



d 2 x 

At 2 



= — 47T 2 Pg X — 



SlUxB 

me elf 



dx 
d t 



"0 ri —2m vt 

7 — E e 



£o 

m 



(5.28) 



where v 0 is the frequency of the oscillator and 7 is the damping constant. 

To find the dipole induced by the oscillating electric field associated with the 
incident electromagnetic wave we disregard the transient solution depending on 
the initial conditions and we look for a solution of the form 



— 27ri vt 

x = a e 



Substitution into Eq. (5.28) gives 

47 t 2 (pq — v 2 ) — 2m py 

where is the Larmor frequency defined in Eq. (3.10) and u 0 is a real unit vector 
directed along the magnetic held. 

Similarly to Sect. 3.2, we introduce the real orthogonal unit vectors u r , u s (such 
that (u r , u s , u 0 ) is a right-handed coordinate system), and their linear combinations 
u ±1 defined in Eqs. (3.25). Expanding the vectors a and E on the basis u a (a = 
0 ,± 1 ), 

a = > E = > 

a. a. 

one easily obtains, with the help of Eq. (3.26) 

a a =X a E a , (5.29) 



a — 87r 2 i a x u 0 = 



— - E 

m 



where \ a i s the electric susceptibility 



Xa = 



47 r 2 m ( i/q — v 2 ) — 2i vT — 2avv- L 



(a = 0, ±1) (5.30) 



with r = 7/47T. 

If there are N oscillators per unit volume, the electric polarization is 



P = —N e 0 a 



and the electric displacement is 

D = E + 47 tP = E — 47rfVeg a . 

Therefore the principal dielectric constants e a defined in Eq. (5.5) are given by 

1 



e =l + ^o 



7 rm (pq — p 2 ) — 2 q;pp l — 2 i vT 



(a = 0, ±1) . 
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Since both and r are usually much smaller than v 0 (see Sect. 3.2), the principal 
dielectric constants are sharp functions of the frequency v peaked at so that we 
can replace (i/q — v 2 ) by (i/ Q — v). 

If we now assume - consistently with the treatment of Sect. 5.2 - the principal 
dielectric constants to be close to unity, and separate the refractive indices n a in 
their real and imaginary parts, 



n a = ( 1 + S a ) + ik a 



(5.31) 



with S a and k a real, we obtain from Eqs. (5.9) and (5.19) 



Ne 2 


r 


Airmb 1 


(^o ^ ai/ L - v ) 2 + r2 


Ne 2 




Aiuni' 


- ais L - v) 2 + r 2 



(5.32) 



It should be emphasized that the basis u a ( a = 0, ±1) introduced above is such 
that the unit vectors u' a appearing in the expression of the propagation tensor G 
(Eq. (5.22)) are given by 

K = K f 

as it can be proved directly from Eqs. (5.8). As a consequence, the propagation 
tensor reduces to 

G jk = -^J2 C <*j C *a k n a , (5-33) 

Ot 

where C a j are the direction cosines defined in Eq. (3.31), and the completeness 
relation (5.17) becomes 

Y, = S H- ( 5 ' 34 ) 

a. 

Equations (5.33), (5.34), and (5.31) allow the coefficients of the propagation matrix 
K given in Eq. (5.26) to be written in the form 1 



Vl = ^^Y k a(\C al \ 2 + \C a2 \ 2 ) 

Ot 

VQ = 27T ~ c Y k a (IC'otI 2 “ \ C cc2\ 2 ) 

Ot 

% = 27r^fc Q 2Re(C al C; 2 ) 

Ot 

v v = ^^Y k ^ 2lm (C a iC* a2 ) 



1 Note that these expressions hold provided the unit vectors e i appearing in the direction cosines 
C j are the reference direction unit vector e a and the associated unit vector e b (see Sect. 5.2). 
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Fig.5.1. The direction of the magnetic field B is specified by the polar angle 6 and the azimuth 
angle reckoned from the reference direction e . In the plane perpendicular to B , the unit vector 
u r lies in the plane containing B and Q; u g is defined accordingly. 

^ = 2 ^EUicj 2 -icj 2 ) 

a 

Pu = 2n^S a 2Re(C al C: 2 ) 

a 

= I cr a2 ). (5.35) 

a. 

Thus in the case considered in this section (propagation of electromagnetic waves 
through an atomic vapor permeated by a magnetic held), the quantities ijq, 
r/jj, rj v depend only on the imaginary parts k a of the refractive indices, while Pq, 
Pjj, p v depend only on the real parts S a . More precisely, rjg, rftj, p v depend upon 
differences between the imaginary parts, as apparent from the fact that these quan- 
tities vanish when the three k a are equal (isotropic case) . Therefore the phenomena 
described by the second matrix in the expansion (5.27) consist in a differential ab- 
sorption of the different polarization states: hence the denomination of ‘dichroism 
matrix’ introduced in Sect. 5.2. By contrast, the third matrix in Eq. (5.27) is 
connected with differences between the real parts of the refractive indices, thus 
it describes phenomena associated with the dephasing of the different polariza- 
tion states in the propagation through the medium (hence the denomination of 
‘dispersion matrix’). 

The direction cosines C ai can be explicitly evaluated once the geometry of the 
magnetic field is specified. From Fig. 5.1 we easily obtain 

u r = cos 6 cos x e a + cos 8 sin x e b — sin 0 O 

u s = -sinx e a + cosx e b 

u 0 = sin 6 cos x? a + sin 8 sin x e b + cos 8 O , 



so that 



1 

72L 



(=F cos 8 cos x — i sin x) e a + (=F cos 8 sin x + i cos x) e b ± sin 8 O 
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Expressions for the bilinear combinations of the quantities C ai 





a = — 1 


a = 0 


a = +1 


\c al \ 2 + \c a2 \ 2 


\ (l + cos 2 0) 


sin 2 0 


\ (1 + cos 2 0) 


\c al \ 2 -\c a2 \ 2 


— ^ sin 2 # cos 2\ 


sin 2 # cos 2% 


— ^ sin 2 # cos 2% 




— ^ sin 2 # sin 2\ 


sin 2 # sin 2% 


— 1 sin 2 0 sin2x 


21m (C al C* 2 ) 


— cos 6 


0 


COS # 



The C ai combinations appearing in Eqs. (5.35) are summarized in Table 5.1. Using 
Eqs. (5.32) one finally obtains the following expressions for the elements of the 
propagation matrix 



Vi 

Vq 

Vu 

Vv 

PQ 

Pu 

Pv 



H£o w I 

me 2 
me 2 



H£o w I 

me 2 
7ren 1 
mc 2 

me 2 

nijvl 

me 2 

H£o w l 

me 2 




y^(! + cos 2 0) 
sin 2 0 cos 2% 
sin 2 0 sin 2% 



% 

% 

V’r 



^b + ^r 
2 

^b + ^r 
2 




cos 8 



sin 2 0 cos 2% 
sin 2 (9 sin2y 



where 



1 r 

n K + ^L “ v) 2 + r 2 
i r 



7T - v) 2 + r 2 

i r 

* H - ^l - u ) 2 + 






A = 



7 T K + ^L “ ^) 2 + ^ 2 
1 



(u 0 -u) 2 + r 2 



, 1 m — i/ 

V» r = - 



^ K - ^L - ^) 2 + ^ 2 



(5.36) 



(5.37) 



The functions <j) h , <j) , <j> T - where the subscripts stand for ‘blue’, ‘parallel’, and ‘red’, 
respectively - have already been encountered in Sect. 3.2 (see Eqs. (3.38)). They 
are Lorentzian profiles centered at the frequencies (v 0 + i/ L ), v 0 , and (y 0 — v L ), 
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Fig. 5. 2. Absorption profiles <J> and dispersion profiles tj) in units of F 1 , for v^j T = 5. 
respectively, and normalized to unity in frequency, 



oo 




— oo 



The functions ip h , ij > p , ip T are the corresponding dispersion profiles. An illustrative 
example of such profiles is shown in Fig. 5.2. These profiles can be easily generalized 
to include the effect of microscopic velocity fields that may be present in the medium 
(see Sect. 5.4). 

An important extension of the theory presented in this section concerns the 
inclusion of the emission term into the propagation equation. This term has already 
been derived in Sect. 3.2, although in a slightly different geometry (see Fig. 3.3). 
For the geometrical configuration of Fig. 5.1 the expressions (3.39) for the direction 
cosines must be replaced by corresponding expressions which lead to the bilinear 
combinations of Table 5.1. The emission coefficient becomes accordingly 








<£b + 



2 



(1 + cos 2 6) 
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£ n = k - 

Q 2 



£ u = k 2 



sin 2 # cos 2\ 
sin 2 9 sin 2\ 



£v ~ k 2 



where 



— <j> h cos 9 , 



k = - 

m n 



(5.38) 



me c * 

E being the mean energy contained into each degree of freedom of the classical 
oscillator. 

Adding to Eq. (5.25) the contribution due to emission, we obtain the radiative 
transfer equations for polarized radiation in the form 



(5.39) 







Q 




u 




\vJ 





Vi 


9q 


Vu 


Vv \ 


f 1 - 3 


9q 


Vi 


Pv 


~Pu 
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Vu 


~Pv 


Vi 


PQ u 


Vv 


Pu 
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where the scalar source function S is given by 

5 



(5.40) 



In particular, under the hypothesis of Local Thermodynamic Equilibrium (LTE) 
the mean energy E is given by k B T, where k B is the Boltzmann constant and T 
the absolute temperature, so that 

S - ^2. k T 

a LTE c 2 "'B 1 i 

which is the classical expression for the Planck function B P in the limit h — » 0 
( approximation of Rayleigh and Jeans). 

Obviously, we cannot expect to find by a classical theory the exact expression 
for the Planck function, which is intimately connected with quantum concepts. 
However, using a semiclassical approach, we can go back to the quantization of the 
harmonic oscillator (see Sect. 4.1) and write 



E ne ~ 



E = hv n 



X 1 

e — 1 



where x = hv 0 /k B T, and substituting this result into Eq. (5.40) we obtain 



Ete — 



e*B T -1 
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an expression that could also be obtained by invoking the Kirchhoff law for the 
emission of a source in LTE. 

Equation (5.39), with the matrix coefficients given by Eqs. (5.36) and with 
the Planck function in the place of the source function, was first obtained by 
Unno (1956) by a heuristic approach which neglected anomalous dispersion effects 
(pq = Pu = p v = 0). These effects were subsequently introduced by Raclrkovsky 
(1962a) through classical considerations, similar to those presented in this section, 
based on the refractive index. Another derivation has been recently presented by 
Jefferies et al. (1989). In the meantime a quantum derivation of the same equation 
was given by Landi Degl’Innocenti and Landi Degl’Innocenti (1972), who pointed 
out some minor errors or ambiguities (mainly connected with sign conventions) ap- 
pearing both in the original papers and in subsequent papers based on those (like 
for instance in Beckers, 1969a). The reader is referred to Rees (1987) for a clear 
presentation of the possible errors that may arise in the derivation of Eq. (5.39). 



5.4. The Voigt Function and the Associated Dispersion Profile 

In the previous section we have derived the absorption profiles <f> h , <p p , <j) T and the 
corresponding dispersion profiles i/j h , i/j p , ij) T for a collection of atoms supposed at 
rest in the laboratory frame (the frame where the Stokes parameters are defined). 
These profiles are of the form p {v 0 — v), where is the frequency of the classical 
oscillator. 

Actually, however, the atoms will always be in motion. In order to realize how 
the profiles are affected by these motions, let us consider a radiation beam of 
frequency v propagating along a given direction, which we identify with the line of 
sight. If w is the velocity component of the atom along the line of sight, the atomic 
frequency is shifted, according to the classical formula of the Doppler effect, to 
the new value 




where we have assumed i»«c (non-relativistic approximation) and where we have 
adopted the sign astroplrysical convention according to which w > 0 when the atom 
is receding from the observer. 

Therefore, for a collection of atoms having a normalized distribution of velocity 
components f(w), the profile p[y 0 — v) must be replaced by the expression 



OO 

J P( u i o - ^07 - v ) /M dw • 



(5.41) 



In many cases of astroplrysical interest the velocity w can be decomposed in two 
parts: the bulk (or macroscopic) velocity w A of the ambient medium, plus a random 
velocity due to thermal or microturbulent motions, usually distributed according 
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to the Maxwellian law and characterized by the velocity ie T . We thus assume for 
f(w) the following expression 



/ M 




(5.42) 



Substituting Eq. (5.42) into Eq. (5.41), and introducing the reduced variables 



. iv T 

Av d — v 0 — 



a = 



r 

Av r 



v„ = 



B Av d 



Va = 



W A _ v 0 w A 



w T c Av u ’ 



v = 



Av t 



one obtains with easy transformations 

1 



V’a = 



a/k Ai 
1 

yjn Ai 



H { v — v A + a v B , a) 



■ L(v — v A + av B , a) , 



(5.43) 



(5.44) 



where a = — 1,0, +1 for the ‘red’, ‘parallel’, and ‘blue’ component, respectively, 
and where we have defined the functions 



H(v, a) 
L[y , a) 




(• v - y ) 2 + a 2 d{J 
(v - y ) 2 + a 2 y 



(5.45) 



In the above expressions Av B represents the Doppler width in frequency units, 
and it is used to normalize all the other quantities: v is the so-called reduced 
frequency, v B the normalized Zeeman splitting, v A the normalized shift due to the 
bulk motion, and a the damping constant. Figure 5.3 shows the two sets of profiles 
(f) and if. 

The functions defined in Eqs. (5.45) are the so-called Voigt function H{v,a ) and 
the associated dispersion profile L(v,a). In previous works on this subject the 
function L(v, a) has been usually written in the form 

L(v, a ) = 2 F(v, a) , 



and the name of Faraday- Voigt function has sometimes been employed for F(v, a). 

It should be remarked that the quantities defined in Eqs. (5.43) can be expressed 
in terms of wavelength (instead of frequency) displacements and broadenings. In- 
troducing the Doppler width in wavelength units, 



. , Aq Wrp 

ZiAn = AlSr, = A n 



C 



(5.46) 
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-10 -5 0 5 10 



Fig. 5. 3. Absorption profiles O and dispersion profiles A in units of ZW D 1 . corresponding to v B = 3, 
v A = 0, and a = 0.05. Note that the magnetic field is 12 times stronger than in Fig. 5. 2. 



Eqs. (5.43) can be written in the form 

A -An 



Z1A d ’ 

; Agp L 
c A\A n 



<.=411 

c Z\A D 

A o e 0 B 
47rmc 2 Z\A n 



= 



C Z\Ar, 



A\ r 

~A\ t 



(5.47) 



where A\ B is defined in Eq. (3.13). With these positions Eqs. (5.44) are still valid. 1 

The thermal velocity w T in the above equations can be related to the kinetic 
temperature T and to the microturbulent velocity £ by the expression 



IMbT , c . 






(5.48) 



1 Note, however, that the profiles <j> defined in Eqs. (5.44) are normalized to unity in frequency. 

The corresponding profiles, normalized to unity in wavelength, are obtained by substituting Z\A D 
for in the same equations. 
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where p is the atomic weight of the atom which originates the spectral line and M 
is the mass of unit atomic weight. Therefore, the normalized Zeeman splitting v B 
can be expressed in the form 

v _ \) e o B 1 

or, numerically 

v B = 1.400 x 10" 7 A 0 B 1 . (5.49) 

J 1.663 x 10- 2 | + C 2 

where A 0 is in A, B in G, T in K and £ in km s -1 . 

In the following of this section we discuss the main mathematical properties of 
the functions H(v, a) and L(v, a) defined in Eqs. (5.45). These properties are easily 
obtained by considering, rather than the separate functions, their complex linear 
combination 

W(v, a) = H(v, a) + iL(v, a) . (5.50) 

Introducing the complex variable z defined as 

z = v + i a , (5.51) 

we obtain with easy transformations 

OO 2 

i r e~ v 

W{z) = - / d y, (5.52) 

7i J z-y 

— OO 



where, being a > 0, we have Im z > 0. The function W(z) is an analytical function 
of the complex variable z that can be related to the complementary error function 
(see Abramowitz and Stegun, 1965). 

By the substitution (z — y) = u we first transform the integral in Eq. (5.52) into 
the following 

2 „ -m 2 +2 uz 

W(z) = — e / d u , 

7T Jl U 

where L is a straight line parallel to the real axis in the half plane Im u > 0 (see 
Fig. 5.4). 

Next we observe that 



_d_ 

dz 



-u-\-2uz 



du = 2 / e u +2uz du = 



2e“ J e (u du = 2v / 7re" 



whence we obtain by integration 

P -u 2 -\-2uz 



du = C + 2 \pK d t . 



(5.53) 
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Fig. 5. 4. Integration paths L and L' in the plane of the complex variable u. 

where C is a constant that can be determined by taking the limit z — > 0 of Eq. (5.53) 

— u 2 +2 uz r — u 2 

d u= [ - d-u . 

U J L U 

The last integral can be evaluated by observing that the integration variable u has 
a pole at the origin. Deforming the integration path L into the path L' shown in 
Fig. 5.4, the contributions arising from the integration along the real axis cancel 
out, while the contribution from the semicircle around the origin simply gives — hr. 
To conclude, we have obtained that the function W (z) can be written in the form 

Z 

1 + f e* dt =e erfc(— iz) , (5.54) 

v 71- J 

o 

where erfc ( x ) is the complementary error function defined as in Abramowitz and 
Stegun (1965). 

From Eqs. (5.52) and (5.54) some important properties of the functions H(v,a ) 
and L(v, a) can be easily derived: 

i) Limiting case a = 0. 

For a = 0, being z = v, Eq. (5.54) gives 

, -V 2 2i -v 2 f t 2 ^ 

W(z)=e H — -= e /e d t, 

V 77 J 

o 

thus from Eq. (5.50) one gets 



W(z) = e 




H(v, 0) = e 



L(v, 0) = -^=D(v) , 



(5.55) 
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where D(v ) is the so-called Dawson integral 



D(v) = e v Je* d t . 

o 



ii) Limiting case v = 0. 
Being now z = ia we get 



so that 



W(z) = e c 



1 + 



2i 



d t 



if (0, a) = 



L(0, a) = 0 . 



a 




0 




(5.56) 



iii) Asymptotic expansion for large values of the argument: ( v 2 + a 2 ) 1. 

In Eq. (5.52) one can write 



therefore 



1 



z-y 





OO 



E 




W(z) 




— OO 



-y 



d y 



. OO 




r(n + 1/2) 

^2n+l 



where T(a;) is the Euler Gamma function. By taking the leading terms of the 
expansion in inverse powers of 2 one easily obtains the asymptotic behavior 



H(v, a) 
L(v, a) 



1 a 

ypf a 2 + v 2 
1 v 

ypK a 2 + v 2 




3v 2 — a 2 
(a 2 + v 2 ) 2 + " ’ 



1 v 2 — 3a 2 
+ 2 (a 2 + v 2 ) 




(5.57) 



iv) Derivatives. 

From Eq. (5.54) we have 

dW(z) 

dz 



-2 zW(z) + , 
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from which we deduce 



— — (H + i It) — — i 7 — (H + i L ) — — 2 (u + i a) ( H + i L ) H — — , 
dv da v 7r 



whence 



dH(v,a) dL(v,a) 
dv da 

dH(y,a) d L(v,a) 



da 



dv 



= 2 
= 2 



— uif (u, a) + aL(v, a) 



-= + aH (v, a) + vL(v, a) 

\/7T 



(5.58) 



a set of formulae which allows to express the derivatives of H{v,a ) and L(v,a) in 
terms of the functions themselves. 



v) Power series expansion. 

For a <C 1, a useful series expansion can be easily derived from Eqs. (5.58). We 
give here the expansion up to the fourth order in powers of a 



H(v, a) = e v + ^2 vD{v) — 1^ + a 2 ^1 — 2v 2 ^ e 



+ — a v 2 — 1 + u(3 — 2 v 2 )D(v) + — (3 — 12u 2 + 4 u 4N ) e v + ■■• 

3 v 7 r L J 6 V / 

L(v, a) = -^= D(v) — 2ave v H — \ =a 2 v + (1 — 2v 2 )D{v) — -a 3 v( 3 — 2v 2 )e v 

V 7T Jtt L 13 



1 



;(5 - 2u 2 ) + (3 - 12 c 2 + 4c 4 )D(c) 



vi) Symmetry properties. 

From Eqs. (5.45) the following symmetry properties are easily derived 

H{—v,a) = H{y,a), L(—v,a) = —L(v,a), (5.59) 

which means that H is symmetrical in v, while L is antisymmetrical. 

vii) Integral properties. 

From Eq. (5.52), integrating in the complex plane 2 along the real axis R, and 
using the residue theorem, we obtain 



W(z) dz = — / dy e 

' 7T ./ 



- U 2 /' 1 



dz = I e y dy = , 



Ir z ~ y 



whence 



OO OO 

J H ( v , a) dc = \pK , J L(v, a) dv = 0 . 



(5.60) 
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This formula proves that the profiles tj> a in Eqs. (5.44) are normalized to unity in 
frequency. 

viii) Convolutions. 

Consider the integral 



J W(v — v 0 , a) W ( v — Vq , a') dv . 

— OO 

Using Eqs. (5.51)-(5.52) and inverting the order of the integrals one gets 

OO 

J W(v — v 0 , a) W(v — Vq, a') dv = 



= y / d ye d ye 



/ 



1 



7T 



(v — v 0 — y + i a) (v — v' 0 — y' + i a') 



du . 



The last integral can be performed via the residue theorem. Since both poles lie in 
the half plane Im v < 0, one obtains 



J W{v — Vq , a) W (■ v — Vq , a') dv = 0 . 

— OO 

Similarly, 

OO 

J W(v — v 0 , a) W(v — v' 0 , a')* dv = 



(5.61) 



1 



dy e v I d y e 



/ - v 



1 



- 



J (v - Vq - y + 1 a) (v - v' 0 - y' - i a ') 

— OO — OO — OO 

and using again the residue theorem we obtain 



OO 

J W(v — v 0 , a) W(v — v'q , a')* dv = 



2 i 



= — / d ye / dy e 



/ ~y 



v o ~ Vq + y' - y + i (C + a) 
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The double integral can be evaluated by the substitution 






x ' = -js (y + v') . 



which leads to 



OO 

J W(v — v 0 , a) W{v — v' 0 , a)* dw = 



= — [ e x ' 2 dx 1 [ 

7T J J v' 0 — v 0 — V2 x + i (a' + a) 



dx . 



Performing the integral in da/ and recalling Eq. (5.52), one finally obtains 

OO 

J W(v — v 0 ,a) W(v — v' 0 , a')* cle = \[^k W ^ °^ / _ 0 , 

— OO 

Substitution of Eq. (5.50) into Eqs. (5.61) and (5.63) yields 



J H {v - v 0 , a) H{v — v' 0 , a’) dv = H ( j° 



J L{v - v 0 , a) L(v - v' 0 ,a')dv = ^ H ( ^° ^° , 

— OO 
OO 

j L{v - v 0 , a) H(y - v' 0 , a ;') dv = L 



These formulae can be easily generalized to more complicated convolutions, 
sidering the integrals 



W { V -^,a]w( V -^,a') du 



and 






(5.62) 



(5.63) 



(5.64) 

Con- 



— OO 
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we obtain with similar manipulations 1 



H 



i a) H 



"o / 



b ' ’ 



du = 



bb' t/tv f v' 0 — v 0 a'b' + ab 

U 1 



Vb 2 + b' 2 \Vb 2 + b /2 ’ Vb 2 + b' 2 



v - v n 



a I L 



v — v: 



o j 



b ' 



, a du = 



bb 1 f v' Q — v 0 a'b' + ab 

H 



V 7 b 2 + b' 2 \Vb 2 + b’ 2 ’ \/b 2 + b' 2 



a H 



o j 



b' 



du = 



bb'^n f v ' 0 — v 0 a’b’ + ab 



Vb 2 + b' 2 \Vb 2 + b 12 Vb 2 + b 12 



(5.65) 



ix) Maxima and minima. 

The function H(v, a), considered as a function of v, has a single maximum at v = 0. 
The maxima and minima of the function L(v, a) with respect to v can be found by 
solving the implicit equation (see Eqs. (5.58)) 

a H (v 0 , a) + v 0 L(v 0 , a) = ~^= . 

v 7r 

Using numerical methods, the value of v 0 is found to be 0.9241 for a = 0, and to 
increase almost linearly to 1.5372 for a = 1. For larger values of a, v 0 converges 
asymptotically towards a as it can be deduced from Eq. (5.57). Obviously, there is 
a corresponding minimum at v = — v 0 . 

x) Numerical methods. 

Several algorithms have been proposed for the numerical computation of the func- 
tions H(v, a) and L(v, a). The most extensively used for astrophysical applications 
are those by Reichel (1968), Hui et al. (1978), and Humlicek (1982). The values 
obtained using these algorithms can be checked against the tabulated values of the 
complex function W{z) given by Faddeeva and Terent’ev (1961). 



Transformation (5.62) should be replaced by the following 

T = by ~ b'y' ^ = b'y + by' 

X v b 2 + b ' 2 ’ X Vb ’ 2 + b ' 2 
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5.5. Symmetry Properties of the Transfer Equations 
for Polarized Radiation 

In Sect. 5.2 we have derived, in the framework of classical electrodynamics, a 
compact expression for the propagation matrix K which describes the transfer of 
polarized radiation, and we have seen that this matrix satisfies the remarkable 
symmetry properties summarized in Eq. (5.27). It will be shown in Sect. 6.7 that 
the quantum-mechanical treatment leads to the same symmetry properties, so that 
it seems natural to ascribe them to general physical principles. Following Landi 
Degl’Innocenti and Landi Degl’Innocenti (1981) we present here a series of argu- 
ments leading to an alternative proof of Eq. (5.27). 

Consider a radiation beam of frequency v, characterized by the Stokes parameters 
I, Q , U, and V, propagating through an arbitrary medium within the solid angle 
clfi in the direction 0. By analogy with the usual transfer equation for unpolarized 
radiation we can write for the Stokes parameters a transfer equation of the form 

d 3 

■^Si = -J 2 K ij S j +£ i (?’ = 0, . . . , 3) , (5.66) 

j=o 

where S is the Stokes vector defined by 

S^ = (S 0 ,S 1 ,S 2 ,S 3 ) = (I,Q,U,V), 



K is a 4 x 4 matrix which is composed in principle of 16 independent quantities, 
and e is the emission vector 

et = ( e 0> e l; e 2> e 3) = ( £ n £ Qi £ U’ £ v) ■ 

We assume here that both K and e do not depend on the Stokes vector S. This 
may indeed appear as a rather restrictive assumption, because there are many 
physical situations, mainly of astrophysical interest, where the properties of the 
medium at any given point depend in fact upon the local radiation field. It should 
be remarked, however, that the angular spread of the radiation beam that we are 
considering can always be made sufficiently small for the properties of the medium 
to be not affected by the beam itself. Keeping this argument in mind, we can 
derive some important properties of K and e by varying arbitrarily - in a sort of 
Gedankenexperiment - the value of the Stokes vector S appearing in Eq. (5.66). 

If we start by considering the limiting case of very intense radiation beams, the 
contribution of the emission vector in Eq. (5.66) can be simply neglected and the 
transfer equations reduce to 

<i = ° 3), 

3=0 



(5.67) 
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from which we deduce 

1 2^- Q2 - U2 - V2 ) = 

= ~K 00 I 2 + K ±1 Q 2 + I< 22 U 2 + I< 33 V 2 

+ (K 10 - K m )IQ + (K 20 - I< 02 )IU + (K 30 - I< 03 )IV 

+ (K 12 + K 21 )QU + (K 13 + K 31 )QV + (I < 2 3 + K 32 )UV . (5.68) 

As proved in Chap. 1 (see Eq. (1.47)), the quantity ( / 2 — Q 2 — U 2 — V 2 ) is always 
positive, except for the case of a totally polarized radiation beam when it reduces 
to zero. Therefore, if we consider a totally polarized radiation beam, 

Si = Qqi U 0 , V 0 ) with I 0 = Q 0 + U 0 + V 0 , 
the right-hand side of Eq. (5.68) must be > 0, 

(-^11 — Koo)Qo + (^22 — ^ 00)^0 + (-^33 — ^ Oo )^0 

+( K 10 - ^oi)^oQo + (-^20 — Ko 2 )IoU 0 + (K 30 — I< 03 )l 0 v 0 

+ (K 12 + K 21 )Q 0 U 0 + (K 13 + K 31 )Q 0 V 0 + (I< 23 + K 32 )U 0 V 0 > 0 . (5.69) 

At this point we make a distinction between depolarizing and non- depolarizing 
media according to whether the greater-than or the equal sign holds in Eq. (5.69). 
In a non-depolarizing medium, by definition, a beam which is totally polarized 
remains so in the process of propagation (provided that no other terms are added 
to the right-hand side of Eq. (5.67)), while in a depolarizing medium this feature 
is lost and the propagation shows a typical irreversible character. 

For non-depolarizing media - the only case that will be considered here - equality 
(5.69) must hold for whatever realization of the Stokes vector S 0 , or, in other words, 
for any values assigned to Q 0 , U 0 , V 0 satisfying the condition /q = (Qg + t/g + Eg 2 ). 
Therefore all the brackets in the left-hand side of Eq. (5.69) must vanish, which 
implies 



^00 — -^11 — K ' 2'2 ~ -^33 



K — K 
Ji oi ~ Ji 10 ) 



K — K 
Ji 02 ~ n 20 



K — K 

n 03 ~ Jv 30 



K — - K 
ly 12 /v 21 ) 



K — — K 
13 /v 31 ’ 



K — - K 

ly 23 — /l 32 ' 



These are just the symmetry relations contained in Eq. (5.27). In the light of the 
arguments presented above we see that these relations are a direct consequence 
of: a) the definition of the Stokes parameters (which implies the inequality I 2 > 
Q 2 T U 2 T V 2 ); b) the linear character of Eq. (5.67) (which is equivalent to the 
assumption K independent of S ); c) the reversible character of the interaction of 
radiation with the ambient medium (which implies the equal sign in Eq. (5.69)). 
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When conditions b) and c) are fulfilled the propagation matrix K can be written 
in the form 



(Pi 




Pu 


Pv 


? ?Q 


Pi 


Pv 


- Pu 


w 


- Pv 


Pi 


pq 


Pu 


-pq 


Pi 



We will now prove that one, or even two, of the seven independent coefficients 
appearing in the expression of K can be set equal to zero by an appropriate choice 
of the reference direction which defines the Stokes parameters. Recalling the results 
derived in Sect. 1.9 (see Eq. (1.45) and Fig. 1.10), the Stokes vector S' relative to 
the ‘new’ reference direction characterized by the angle a can be written in the 
form 

S' = 71(a) S , 



where 



/I 0 0 0\ 

0 cos 2a sin 2a 0 1 
0 —sin 2a cos 2a 0 I 
VO 0 01/ 



Thus in the new reference frame the expression for the propagation matrix is 



K' = R(a)KR~ 1 {a) , 



whence we deduce after some easy algebra 
p'i = P i > Pv = Pv > Pv = Pv 

ij'q = cos 2a t]q + sin 2a p v , Pq = cos 2 a pq + sin 2a p v 

p'jj = — sin 2a pq + cos 2a 7] v , p' v = — sin 2a pq + cos 2a p v . (5.70) 

These formulae show that while the quantities r/j, p v , and p v are invariant with 
respect to the choice of the reference direction, ?y q and p v change into each other 
according to a rotation through an angle 2a in the Pq-rfrj plane, and the same 
happens for pq and p (J . It follows that the two quantities p h and p L defined by 

? ?l ~ V q + Vu ’ Pl — Pq + Pu 

are invariant. Moreover, it is possible to choose a particular reference direction, 
specified by the angle a 1 , such that p' lT is zero, and another reference direction, 
specified by the angle a 2 , such that p'jj is zero. The angles a 1 and a 2 may be 
different in principle, but in most cases of interest they are found to be equal. 
When this happens we will call preferred reference direction the one for which p'jj 
and p' lT are both zero. 

Obviously, the expressions derived in Sect. 5.2 for the elements of the propagation 
matrix K are consistent with the transformation law given in Eqs. (5.70). This 
can be proved directly from the general equations (5.26) by taking into account 
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the transformation properties of the unit vectors e a , e b under rotation of the ref- 
erence direction. The proof is straightforward and will not be given here. In the 
special case of the propagation of radiation in a magnetized medium (Sect. 5.3) one 
can easily see from Eqs. (5.36) that the preferred reference direction lies (indepen- 
dently of frequency) in the plane containing the magnetic field and the direction 
of propagation. 1 

The above remarks show that when a preferred reference direction can be defined 
the propagation matrix K can be cast into the simpler form 



(V! 




0 


Vy 


? ?Q 


Vi 


Pv 


0 


0 


- Pv 


Vi 


PQ 


w 


0 


~PQ 


Vi 



Going back to Eq. (5.66), we can also deduce some interesting properties of the 
emission vector e by considering the limiting case of very weak intensity of the 
radiation beam. Under this limit the transfer equations reduce to 



d Si 
d s 



= 



(/ =()...., 3 ) . 



and by integration over the interval As one gets 



S)(s + Z\s) = ,St( s ) + e . As . 

Therefore, if we suppose <S)(s) = 0 we obtain that the following inequality must be 
satisfied 

£ i ^ \/ £ q+ £ u + £ v ■ ( 5 - 71 ) 

In the case of the propagation of radiation through a magnetized medium this 
relation can be proved directly from Eqs. (5.38). 

It should be noticed that the corresponding inequality for the components of the 
diclrroism matrix, namely 



Vi > \J Vq + Vu + Vv - ( 5 - 72 ) 

does not hold in general. However, under Local Thermodynamic Equilibrium or, 
more generally, when one can write 



( £l 

e Q 



= s 




inequality (5.72) follows directly from (5.71). 



(5.73) 



1 



Indeed, there is another preferred reference direction at 90° from the former. 
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Another property of the emission vector concerns its transformation under a 
rotation of the reference direction. It is easy to prove that the rotation leading to 
Eqs. (5.70) induces the following transformation on the components of the emission 
vector 

£ i 

cos 2a £q + sin 2a e v 
— sin 2a Eq + cos 2 a £ n 

e v ■ 

These formulae show that the quantity e L defined by 

2 2,2 
£ L = e Q+ £ U 

is invariant, and that it is possible to choose a particular reference direction, charac- 
terized by the angle a 3 , such that e' v is zero. When a preferred reference direction 
can be defined ( 77 ^ = p' v = 0) it often happens that e' v is zero as well. 




5.6. Geometrical Interpretation of the Transfer Equations 
for Polarized Radiation 



In Sect. 1.9 we have introduced the concept of Poincare sphere as a mapping be- 
tween Stokes vectors and points within a sphere of unit radius (see Fig. 1.11). 
Thus it is quite natural to give a geometrical interpretation to the radiative trans- 
fer equations for polarized radiation in terms of the motion of the representative 
point within the Poincare sphere. This is schematically illustrated in Fig. 5.5. 

In a right-handed orthogonal system ( xyz ) we consider the formal vectors p , fj , 
p , e defined by 



_ (Q U V 

p= VT’T’T 

V= (Vq,Vu’Vv) 

P = (pq> Pu> Pv) 

=*_ £_ ffQ e JL £ -V_\ 

I V I ’ I ’ I ) ’ 



and the scalar quantity 




With these positions the transfer equations for polarized radiation can be trans- 
formed into the following 



d I 

ds 

dp 

ds 



—fj + (fj ■ p) p + p x p + e — Cjp . 



(5.74a) 



(5.74b) 
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Fig. 5. 5. The transfer of polarized radiation in an arbitrary medium can be described in terms of 
the motion of the representative point inside the Poincare sphere. The point cannot pierce the 
surface of the sphere but may describe a curve that is tangent to it, like in the point P. The 
starting point P Q represents the radiation entering the medium. 

The scalar equation describes the transfer of the beam intensity and it can be 
considered as the equation which controls the scaling factor connecting the Stokes 
parameters with their fractional value given by the vector p . It is interesting to 
note that this equation differs from the standard equation for unpolarized radiation 
only for the presence of the term (if • p) which adds to the (generalized) absorption 
coefficient r/j. Note also that the dispersion term, p, does not appear directly 
in this equation, nevertheless it affects the intensity via the term (rf • p ) (since p 
depends on p). 

The vector equation describes the motion of the representative point inside the 
Poincare sphere. The first term in the right-hand side tends to align the fractional 
polarization p with the vector —if . If we consider for instance the simple case 
where rf does not depend on s, and if we take p = 0 (unpolarized radiation) as the 
boundary condition at s = s 0 , when the ray has traveled a distance As from s 0 we 
obtain from this term 

p(As) — —if As , 

and, as As grows, the representative point might pierce the Poincare sphere. This 
is however prevented by the second term, which is opposed to the first one and 
whose relative importance grows as the point approaches the surface of the sphere. 
Clearly this term describes a phenomenon of saturation for the fractional linear 
polarization. 

In this respect it is also interesting to consider the effect of the term (if ■ p) 
in the scalar equation for the intensity. In the simple situation discussed above 
(if = const., p = 0 at s = s 0 ) this term brings a negative contribution to the 
absorption coefficient, since p is directed along —if. Therefore, as the radiation 
beam propagates through the medium, the polarization grows and the medium 
becomes more and more transparent. We are facing again a saturation phenomenon 
which is strictly connected with the former. 
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(a) (b) 

Fig. 5. 6. Geometrical interpretation of Faraday rotation (a) and Faraday pulsation (b). In case 
(a) only the component p v is non-zero and we have a transformation from Q t o U . and vice versa. 
In case (b) the component />, , induces a transformation from Q to V, and vice versa. 

As an illustrative example, let us consider a low-temperature medium (i.e. a 
medium with negligible emission) which is opaque to left-handed circular polariza- 
tion, and suppose that an unpolarized radiation beam is entering the medium at 
s = s 0 . During the propagation, the left-handed circular polarization will decrease 
exponentially, so that right-handed circular polarization only will remain in the 
beam and the medium will be totally transparent. 

Let’s now consider the third term in the right-hand side of Eq. (5.74b). This term 
causes a precession of the vector p about the vector p ; in this motion the absolute 
value of the fractional polarization is unchanged, while the type of polarization 
changes. In particular, the 2 -component of the vector p, p v , induces a rotation of 
the representative point about the 2 -axis, which implies a rotation of the direction 
of maximum linear polarization. The x and y components (pg and pjj) give rise to 
a rotation about an axis belonging to the x-y plane, which implies a transformation 
from linear to circular polarization and vice versa. These two phenomena, which 
are illustrated in Fig. 5.6, will be referred to in the following as Faraday rotation 
and Faraday pulsation, respectively, although these names should more properly be 
employed when dealing with the propagation of polarized radiation in a magnetized 
medium far from resonances. 

An important property of the term (p x p) is that it can be efficient in removing 
the alignment between the vectors p and —if that tends to arise in the schematic 
situation discussed previously. Thus the presence of dispersion reduces the impor- 
tance of the saturation mechanisms that we have just pointed out. The only case 
where the effects of dispersion can be neglected is when the three formal vectors if , 
p , and e point in the same direction. In this case, if we add the further condition 
that the radiation beam is unpolarized when entering the medium, the vector p 
will also be aligned with the same direction, and the term (p x p) vanishes. 
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Comments similar to those on the terms —ff and ( ff ■ p)p can be made for the 
last two terms in Eq. (5.74b), e and — e 7 p. The fourth term tends to align the 
polarization p with e, but when p starts growing the other term comes in and 
prevents, through a saturation mechanism, the possibility of piercing the surface 
of the Poincare sphere. 

Finally, let us consider the special case described by Eq. (5.73) (which includes 
the LTE case). Defining the reduced scalar source function s as 

_ S 
s = — , 



we have from Eq. (5.74b) 

d p _ . _ 

— = -(1- s)ri+ (p ■ p- srj I )p + px p , 

an equation containing the driving term, the saturation term, and the dispersion 
term. 



5.7. Resonance Scattering and the Hanle Effect 

We have seen in Sects. 5. 1-5. 2 how it is possible to deduce the transfer equations 
for polarized radiation using the concepts of classical electrodynamics. The general 
equations (5.25)-(5.26) have then been applied to the case of an atomic vapor 
embedded in a magnetic field. By schematizing the atomic system as a negative 
charge oscillating under the action of an elastic, restoring force, we have derived 
Eqs. (5.36) which give the basic quantities describing the transfer of polarized 
radiation in a magnetic field. 

In this section we will use the same atomic model to deduce the laws of resonance 
scattering and of the Hanle effect. Starting from Eq. (3.34), we can write for the 
frequency-integrated radiation emitted per unit time in the solid angle dO by an 
atomic oscillator embedded in a magnetic field 

dl ij (n)=£j ij (r)r 2 dn 

oo 

= d0 E C «i C !V ^ W du , (5.75) 

a P -oo 

where the symbols have the same meaning as in Sect. 3.2. 

To describe a scattering experiment we will now assume the initial values of 
the amplitudes, A a , to be proportional to the corresponding components S' a of 
the resonant electric field impinging on the oscillator from the direction f V (see 
Fig. 5.7). Note that the only difference between the present case and the case 
considered in Sect. 3.2 lies in the assumption on the amplitudes A a . To obtain 
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Fig. 5. 7. A radiation beam is scattered from the direction f!' to the direction f! in the presence of 
a magnetic field. 



the emission in the Zeeman effect we assumed the amplitudes to be completely 
uncorrelated, 

A* a A 0 = Ws a p; 

by contrast, we now assume 

A* a A 0 = K£':£' 0 <m' , (5.76) 

where clO' is an infinitesimal solid angle around the direction O'. Note that the 
frequency dependence of the components £' a in the above equation is not specified: 
they are supposed to be constant over a spectral interval centered at the reso- 
nance frequency p 0 and much larger than the Larmor frequency (flat spectrum 
approximation) . 

If we now introduce two mutually orthogonal unit vectors perpendicular to the 
direction O', 

e'-O' = 0 (i,j = 1,2), 

we can write 

= (5.77) 

i 

where the direction cosines Cl fi are given by 

C' 7i = S • ef . 

Substituting Eq. (5.77) into Eq. (5.76), and averaging over all the possible realiza- 
tions of the incident electric field, we have 

A « A P = T E (^) dfi ' - (5- 78 ) 

ij 

where k is the constant appearing in Eqs. (1-34) and /b (O') is the polarization 
tensor of the incident radiation. 
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The dimensional constant K can be determined by considering the special case 
of thermodynamic equilibrium. In this case the incident radiation (isotropic and 
unpolarized) at the resonant frequency v 0 is characterized by the polarization tensor 

ji = v l k B T s 



where we have used the classical expression for the Planck function. Substitution 
into Eq. (5.78) gives 



KA, = T / £ eyes 



K Vq k B T 



f[6 a p-(Q'-uJ(n'-ut)]dn' 



8ir K Vq k B T 
II c 2 



a/3 * 



(5.79) 



On the other hand, the amplitudes of the atomic oscillations in thermodynamic 
equilibrium are given by (see Eq. (3.35) for a similar derivation) 



A* A - 7TkBT 
^ “ 27 r% 2 m ^ ■ 

Comparison of expressions (5.79) and (5.80) leads to 

3 jtc 2 



I< 

~k 



167T 3 Uq m 



(5.80) 



(5.81) 



Using Eqs. (5.78) and (5.81), Eq. (5.75) becomes 
= 

OO 

■ 7 E C «i C K E C 'ak C Pl r kl (^) ^ d0 / F »* W > ( 5 - 82 ) 



3 7reQ 
2 me 



a/3 



where we have extracted the factor z/q from the integral since the Fourier transforms 
F a ( v) are substantially non-zero only for v ~ u 0 (see Eq. (3.33)). 

The frequency integral can be easily evaluated with the help of the residue the- 
orem 

OO 

1 (5.83) 






a — P)H 



H = 






2mc r y 



where 



(5.84) 
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The quantity H is proportional to the magnetic field intensity; numerically we have 

H = 0.879 B / 7 , (5.85) 



with B in G and 7 in 10' s 1 . 

Substitution of Eq. (5.83) into Eq. (5.82) leads to 

dQ' 

d 4 (fi) = a cl Y, T HM (fi, n';B) 4 4') — dfi , (5.86) 

kl 



where 



oy 1 = 



7re5 

me 



is the classical (frequency-integrated) cross section of the atomic oscillator, and T 
- the so-called scattering phase matrix for the polarization tensor - is defined by 



T ijM (n,n'-B) - c^c Pj c f ak c^ 

rvft 



1 



(5.87) 



This matrix depends on the geometry of the scattering event and on the magnitude 
and direction of the magnetic field vector; moreover, it depends on the choice of 
the unit vectors (e 1; e 2 ) and (e{, e! 2 ). Note that expression (5.87) is valid both 
for real unit vectors and, more generally, for complex unit vectors of the form of 
Eqs. (1.41). 

The following symmetry property can be proved directly from Eq. (5.87) 

T ijM {n^-BY=T jitlk {n,^'-B) . (5.88) 

If the direction of the magnetic field vector is reversed, one gets a sign inversion of 
the term in H in Eq. (5.87) (see Eqs. (3.23), (3.27), and (3.33)). This leads to the 
further property 

T ijM (n,fi'-,-B) =T lkJi (ft,fr,g). (5.89) 

In the case of zero magnetic field the expression for the matrix T reduces to a much 
simpler form. Summation over the indices a and /3 yields 

T ijM & 0 ) = T ijM { a, n r ) = | D ik D * jt , 

where the direction cosines D ik are given by 

D ik = e) • e'* . (5.90) 

Equation (5.86) can be transformed into an equivalent equation relating the 
Stokes parameters of the incident and scattered radiation. As noticed several 
times, the relation between Stokes parameters and polarization tensor depends 
on the choice of the unit vectors e) and e[. The most natural choice is to take, 
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both for the incident and for the scattered radiation, the reference direction unit 
vector and the associated unit vector defined in Sect. 1.6. In other words, we take 

e, = e a (n) e[ = e'(f2') 

e 2 = e b(^) e 2 = e b(^ ) ’ 

In this case the required relation is given by Eqs. (1.40), which imply that the 
frequency-integrated Stokes parameters dS' i (Il) scattered in the solid angle clO by 
an atomic oscillator are given by 

( Vi) nm d , 



with a i the Pauli spin matrices defined in Eq. (1.17). Using again Eqs. (1.40), we 
obtain 



_ ^ _ r ]Q' 

dS i (Q)=a d ^2R ij (Q,Q!-,B)S , j (Q!)—dn (i, j = 0, . . . ,3) , (5.91) 

j 

where the scattering matrix R for the Stokes parameters, sometimes called the 
phase matrix, is given by 



R iJ (n,n'-B) = 

= \ ( a i)lk( a j)mn 



klmn 



3 

4 






kin 



y 

mn / j 
a (3 



^ak^pl^am 



C\ 



0 n 



1 

1 + i (a — /3)H 



(5.92) 



In the case of zero magnetic field the matrix R reduces to the so-called Rayleigh 
phase matrix 

Rij(^’ ; 0) = RijiQ, 6) = £ ^ ( a i)lk( a j)mn ^km^ln I 

klmn 

where the direction cosines D- are defined in Eq. (5.90). 

Since the Stokes parameters are real, the scattering matrix must be real as well, 

R^n'-By = R ij (n,n , -B)- (5.93) 

this can be checked directly from Eq. (5.92) recalling that 

O i)nm = ^mn ■ 

Moreover, from Eq. (5.89) one gets 



(5.94) 
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incident 

radiation 




Fig. 5. 8. The Hanle effect is illustrated by this simple scattering experiment. For zero magnetic 
field the polarization of the scattered beam is perpendicular to the plane of scattering; when 
a magnetic field is present, the polarization is reduced and its direction is rotated through an 
angle a. 



In the particular case of zero magnetic field this equation becomes 

R ij (n,n') = R ji (n',n) , (5.95) 

which is known as the Helmoltz principle of reciprocity for a scattering event. 
Thus Eq. (5.94) represents the generalization of this principle to the case where a 
magnetic field is present. 

The equations that we have derived in this section describe the scattering of 
polarized radiation by an atomic system, schematized as a harmonic oscillator, in 
the presence of a magnetic field. In the limiting case of zero magnetic field the 
results here obtained reduce to the classical results of Rayleigh for the scattering 
of polarized radiation. 1 

In the presence of a magnetic field the laws of scattering are deeply modified and a 
new phenomenon known as Hanle effect - appears. Discovered in the laboratory 
in the early years of this century (Hanle, 1924), this effect usually produces a 
depolarization of the scattered radiation and a rotation of its plane of polarization 
(see Fig. 5.8). However, this holds only for particular geometries of the scattering 
event - probably the most used in laboratory experiments and the most important 
for astrophysical applications. For different geometries the effect of the magnetic 
field can be very different, and in fact the polarization of the scattered radiation 
can even be increased relative to the non-magnetic situation. Thus, in general 
terms, one can only state that the Hanle effect consists in a modification of the 
scattered radiation due to the presence of a magnetic field. These points will be 
further clarified in Sect. 5.9. 



It should be remarked that Rayleigh’s law of scattering can be applied to a broader class 
of phenomena and not only to the scattering by a harmonic oscillator - the simple case on which 
we have based our deduction. Thomson scattering by free electrons, or scattering by a dielectric 
sphere having a radius much smaller than the wavelength of the radiation, are just two examples 
where Rayleigh’s law can still be applied in full generality. The only difference with the case 
considered here is in the cross section <r cl appearing in Eqs. (5.86) and (5.91). 
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Fig. 5. 9. The most general geometry for the scattering event. The unit vector 1, lying in the plane 
perpendicular to O, is the reference direction unit vector e a (f2), while 2 is the associated unit 
vector e b (Q). The unit vectors 3 and 4 are and e b (Q'), respectively. 



Equation (5.92) describes the Hanle effect from the classical point of view; we 
will see in Chap. 10 how this equation is generalized in the quantum theory. 



5.8. The Scattering Phase Matrix in a Particular Case 

In the previous section we have obtained a general expression for the scattering 
phase matrix that can be applied to any geometrical configuration of the scattering 
event and of the magnetic field and to any choice of the reference directions for the 
Stokes parameters. For practical applications it is however useful to have explicit 
analytical expressions for the scattering phase matrix in particular geometrical 
situations. 

The most general geometry is illustrated in Fig. 5.9, where the three directions Cl, 
O', and B are specified by their polar angles and where the reference directions are 
defined by the angles 7 and yb In this geometry the scattering phase matrix will 
depend on 8 parameters, namely 9, 9 ' , 9 B , ( y — Xb)> (x ? — Xb)> 7) 7 7 ) an d and 
the calculation of one element will require the sum of 144 (= 3 2 x 2 4 ) terms. 
It is not worth here to develop in detail these calculations; the interested reader is 
referred to Landi Degl’Innocenti and Landi Degl’Innocenti (1988), where an ana- 
lytical expression for the phase matrix is derived under the assumption 7 = y' = 0 
(which means that the positive Q-directions for the incident and scattered beams 
lie in the meridian planes of Fig. 5.9). Being deduced from the quantum theory, 
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Fig. 5. 10. A particular geometry for the calculation of the scattering phase matrix. The polar 
angles of the directions Q and Q' are reckoned from the magnetic field direction. The meaning of 
the unit vectors 1, 2, 3, 4 is the same as in Fig. 5. 9. 

the expressions presented in the paper just quoted are in fact more general than 
those obtainable from Eq. (5.92); however, they reduce to the classical results with 
easy transformations. 

Some general properties of the scattering phase matrix relative to the geometrical 
configuration of Fig. 5.9 will be given in Sect. 5.12. Here we want to derive the 
explicit expression of this matrix for the much simpler geometry illustrated in 
Fig. 5.10. 

Now the magnetic field vector is parallel to the z-axis and the reference direction 
for the Stokes parameters lies in the meridian plane containing the magnetic field 
and the propagation direction. The various unit vectors are given by 

e a (H) = e 1 = cos 6 cos y i + cos^siny j — sin0 k 

e b (H) = e 2 = - sin y i + cosy j 

e'(fi') = e[ = cos0'cosy' ?+ cos O' sin y' j — sin#' k 

e b (H') = e' 2 = -siny' i + cosy' j 

u 0 = k 

u ±i = ^ (T? + if) , 

and the direction cosines can be easily evaluated 
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C 1X = — -= cos 8 e 
v 2 

C 0 1 = — sin 8 

C i , = -4= cos 9 e' 
” y/2 



( < — J_ p 1X 
Cl2 ~V2 

C 02 = 0 
C — p lx 

~ 12 ~V2 



with analogous expressions for C' ai . 

From these expressions the elements of the scattering phase matrix for the polar- 
ization tensor, given by Eq. (5.87), can be evaluated via some heavy algebra that 
is left to the reader as an exercise. The results are the following 

T n> n = | 1 - /r 2 - /z' 2 + | /z 2 /z' 2 + 2 W /Vl - £i 2 cv + \ A*V 2 C 2 



^11,12 — ^11,21 — 9 M V 1 M 2 V 1 M /2 9 M Sr 



ri2.ii = r 21 ,u = | Si + ^ w' 2 s 5 



T nl oo = 



T — 

1 OO 1 1 



V-pM 2 C 2 



m' 2 - pM ,2 c 5 



^12,12 — r 21 21 — 



i /z/z' + \/l - /z 2 \/l - Ad 2 Cl + ^ /z/z' C 2 



3 1 



^12, 21 — T 21,12 — p p MM + p MM C. 



iz.zi 2 ^ 2 r r 

^12, 22 = ^21 22 = p [ — p 



2 2 — 

3 r i ^ 

T 22,21 = ^22,12 = 2 2 M S 2 

T 22,22 = 2 2^2 ’ 



(5.96) 



where 



/z = cos ( 



/z' = cos 6 1 ' 



C 1 = cos cty cos (ay + X 7 — x) Si = cos ay sin (ay + X 7 ~ x) 

C 2 = cos a 2 cos(a 2 + 2 (x' - x)) S 2 = cos a 2 sin(a 2 + 2 (x' - x)) 



(5.97) 



and where the angles a 1 and a 2 , implicitly defined by 



tan a 1 = H , 



tan a 2 = 2 H , 



(5.98) 
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are particularly useful to express the quantities (1 ± i H) 1 and (1 ± 2i H) 1 in the 
form 

a I • TT\ 1 T 2 

± iH ) = cos e 1 = cos ay =F i sin a 1 cos a 1 

(1 ± 2if7) _1 = cosa 2 e T1Q2 = cos 2 a 2 =F i sina 2 cosa 2 . 

Note that the intensity of the magnetic field is contained only in the quantities C x , 
C 2 , S 1 , S 2 . In particular, for zero magnetic field one gets 

Ci = cos(x' - X) Si = sin(x' - x) 

C 2 = cos 2(x' - x) S 2 = sin 2(x' - x) , 

and for strong magnetic field ( H 1) 

c 1 = s 1 = c 2 = s 2 = o, 

so that the expressions of the matrix elements are considerably simplified. 

Finally, substituting Eqs. (5.96) into Eq. (5.92) and using the explicit expressions 
of the Pauli spin matrices (Eqs. (1.17)), one obtains for the scattering phase matrix 
of the Stokes parameters the following expressions 

R 00 (n : S ) = | ( 3 — m 2 — M ' 2 + 3/iV 2 ) + | wVl^M 2 \A-M' 2 Ci 

+ ^(i-m 2 )(i-a/ 2 )c 2 

R 01 (6, ft; B ) = i (1 - 3 M 2 ) (1 - A / 2 ) + | w/ x/W 2 x/T - M 72 Ci 

-^(i-m 2 )(i + m' 2 )c 2 

R 02 (n, ft; B) = - ^px/T - Jr Si + 1 (1 - s 2 

7? 03 (i1,fI , ;B)=0 

i? 10 (fl, 6'; B ) = | (1 - M 2 ) (1 - 3a/ 2 ) + | /Wl-M 2 x/l-M' 2 C, 

-^(i + m 2 )(i-a/ 2 )c 2 

i? n (fl, 6'; S ) = | (1 — m 2 ) (1 - A/ 2 ) + | /Vx/W 2 x/W 2 C x 
+ g (! + f* 2 ) (l + l^ 2 ) C 2 

Ri 2 (n : Q';B) = — | /rx/T - M 2 x/W 72 ^ (l + mV S 2 

Ri 3 (Ct,n';B) =0 
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R 20 (n, d' ; b ) = - mVi - V 2 V - V 5i - - m(1 - m' 2 ) ^2 



i? 21 (0, d' ; b ) = - mV 1 - V - V Si + 4 m(i + V) s 2 



i? 22 (d, d' ; b ) = - x/w 2 v + - ^ c 2 

R 23 (n,n';B) =0 
B 30 (d,d' ; B)=o 
i? 31 (d,d , ;S)=0 
i? 32 (d,d , ;S)=o 



fl 33 (fi ) fi';B) = - /1/1 ' + -VVVlVC 1 . 



(5.99) 



These formulae show that the V Stokes parameter ( S 3 ) is totally decoupled from 
the other Stokes parameters in a scattering event. 

In the limiting case of strong magnetic field one gets 



/§ + 1(1-3m 2 )(1-3m' 2 ) (1-3m 2 )(1-m' 2 ) 0 



/?(d,d';5 



o=; 



o 



V 



(l-M 2 )(l-3// 2 ) 

0 

0 



\ 



3(1-m 2 )(1-V) 0 0 

0 0 0 
0 0 4mm' ) 



which shows that the U Stokes parameter of the scattered radiation is zero irre- 
spective of the direction and polarization of the incoming beam. If, in addition, the 
incoming beam is unpolarized, the fractional polarization of the scattered radiation 
is found to be 

Q (1-M 2 )(1-3m' 2 ) 

I 3 — m 2 — M /2 + 3m 2 m' 2 

thus it is positive, negative, or zero according as (m' 2 — 1/3) is negative, positive, 
or zero. Defining the Van Vleck angle 0 V according to the formula 



which implies 



one obtains 



cos 0 V = 1 / Vs , 
0 V = 54°. 74 , 



(5.100) 

(5.101) 



y > 0 (linear polarization parallel to B) 

Q 



for 6* v < O' < n — 6- 



v 



< 0 (linear polarization perpendicular to B) for 0 < 9' < 0 V 

Or 7T — 0-sj < 9' < 7T . 
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Fig. 5. 11. Geometry for a 90° scattering. The magnetic field lies in the plane perpendicular to 
the incident beam and makes an angle (3 with the direction of the scattered beam. The reference 
direction for the Stokes parameters of the scattered radiation is perpendicular to the scattering 
plane. 

In other words, for an unpolarized incoming beam and in the limit of large magnetic 
field ( H 1), the linear polarization of the scattered beam is either parallel 

or perpendicular to the magnetic field according to the direction of the incident 
radiation. 

Another consequence that can be easily deduced from Eqs. (5.99) concerns the 
particular case of an incident beam parallel (or antiparallel) to the magnetic field 
vector. Setting jJ = ± 1 in Eqs. (5.99) we see that the matrix elements of the 
form R i0 ( i = 0, . . . , 3) are independent of the magnetic field. Therefore, if the 
incident beam is unpolarized the scattered radiation is completely insensitive to 
the magnetic field. 



5.9. Some Illustrations of the Hanle Effect 



To understand in further detail the role of the magnetic field in resonance scattering, 
it is convenient to apply the results obtained in the previous section to some specific 
geometrical configurations of the scattering process. 

Consider first the configuration of Fig. 5.11, where we want to investigate the 
polarization of the radiation scattered at 90° from an unpolarized incident beam. 
The scattering event takes place in the presence of a magnetic field which lies in 
the plane perpendicular to the incident beam. 

Since the incident beam is unpolarized, the only elements of the scattering phase 
matrix that must be computed are those of the form R i0 (i = 0, . . . , 3). Taking as 
reference direction the perpendicular to the scattering plane (see Fig. 5.11), these 
matrix elements can be deduced from Eqs. (5.99) by the substitutions 

7T 

A< = cos/3, n' = 0, X~X=2' 



This leads to the expressions 



R 



oo — 




cos 2 /3 — sin 2 /3 cos 2 a 2 ) 
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Fig. 5. 12. Polarization diagram (or Hanle diagram) relative to the scattering process illustrated in 
Fig. 5. 11. Full lines correspond to (3 = const., while broken lines correspond to constant magnetic 
field strength, parameterized through the quantity a 2 (see Eqs.(5.84) and (5.98)). 

3 

R 10 = o (sin 2 /? + (1 + cos 2 /?) cos 2 a 2 ) 



i?20 = cos P s i R a 2 cos a 2 

^30 = 0 ) 

so that the fractional linear polarization of the scattered beam is given by 

Q sin 2 /? + (1 + cos 2 /?) cos 2 a 2 
I 3 — cos 2 /? — sin 2 /? cos 2 a 2 

U 2 cos /? sin a 2 cos a 2 /X ino\ 

T = « Ya • 2 n 2 ' (5.102) 

1 3 — cos 2 /? — sin /? cos 2 a 2 

The results now deduced can be conveniently plotted in the polarization diagram 
(or Hanle diagram ) shown in Fig. 5.12. The solid lines are isoazimuth curves 
(/? = const.), while the broken lines are isostrength curves (a 2 = const.). For 
any assigned couple (/?, a 2 ) the diagram gives the polarization Q/I, U/I of the 
scattered beam. Note that the values of /? can be confined in the range [0, 7r] since 
Eqs. (5.102) are invariant under the substitution /? — > — /?. 

There are several characteristics of Eqs. (5.102) that it is worthwhile to discuss 
in some detail. First of all, in the case of zero magnetic field (a 2 = 0) one gets 

Q U 

— = 1 — = 0 

I ’ I ’ 



(5.103) 
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thus the scattered beam is totally polarized perpendicularly to the scattering plane. 
On the other hand, for very strong magnetic field (a 2 = 7t/2) one obtains 

Q _ sin 2 /? ——o 

I 3 — cos 2 (3 ’ I 



so that the scattered beam is again polarized perpendicularly to the scattering 
plane (consistently with our former discussion on the Van Vleck angle); however, 
the fractional linear polarization varies now between 0 and 1/3 according to the 
value of (3. 

For intermediate values of the magnetic field, the U Stokes parameter is in general 
non-zero, which means that the direction of linear polarization is rotated from the 
zero-field direction. The maximum rotation for a given magnetic field strength 
occurs for (3 = 0 or n. In these cases we have 



Q 2 

— = cos a, , 
I 2 



U 

1 



= ± sin a 2 cos 



where the plus sign for U/I corresponds to (3 = 0 and the minus sign to (3 = n. In 
both cases the total linear polarization is given by 



Pl = \Z(f) 2+ (f) = cos “> = vrhm • (5 - 104) 

while the position angle a 0 (see Eqs. (1-46)) is 

a 0 = ± — a 2 = ± — arctan2IF (5.105) 

with the plus sign for (3 = 0 and the minus sign for (3 = n. This formula shows 
that a magnetic field with the same direction as the scattered radiation produces 
a counterclockwise rotation (for an observer looking at the scattering point) of the 
plane of linear polarization; conversely, the rotation is clockwise if the magnetic 
field is in the opposite direction. 

Equations (5.104) and (5.105) are often used to discuss the role of the Hanle 
effect in scattering polarization (see e.g. Mitchell and Zemansky, 1934). They are 
however of limited use because they refer to an extremely particular case. 

In the scattering configuration just considered the magnetic field produces, be- 
sides a rotation of the polarization direction, a decrease of the polarization degree 
(depolarization); but this is not always the case. A counter example is provided, 
for instance, by forward scattering. 

The relevant geometry is now illustrated in Fig. 5.13, where we consider the effect 
of a magnetic field on the forward scattering of an unpolarized radiation beam. The 
polarization of the scattered beam can again be deduced from Eqs. (5.99) by the 
substitutions 

l_ L = p! = cos/3 , X = X'- 
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Fig. 5. 13. Forward scattering. The reference direction for the Stokes parameters lies in the plane 
containing the magnetic field and the direction Q. 



We 


obtain 






“00 


= 1 + ^ (3 cos 2 (3 - 
8 


28 3 

- 1 ) + - sin 2 /? cos 2 /? cos 2 a 1 + - sin 4 /? cos 2 a 2 
2 8 




*10 


3 

= - sin 2 /? [l — 3 cos 2 /? + 4 cos 2 (3 cos 2 aq — (1 + cos 2 /?) cos 2 a 2 1 
8 




*20 


3 

= - sin 2 /? cos/? [2 


sin aq cos a 1 — sin a 2 cos a 2 ] 




*30 


= 0 . 




For 


zero 


magnetic field (aq 


= a 2 = 0 ) one gets 



(5.106) 



so that the scattered beam is unpolarized as the incident beam. For strong magnetic 
field (aq = a 2 = n/2) one obtains 

Q (1 — cos 2 /?)(l — 3 cos 2 /?) U 
I 3 — 2 cos 2 (3 + 3 cos 4 /? ’ I 



or, in other words, a linear polarization which is parallel or perpendicular - accord- 
ing to the sign of the quantity (1 — 3 cos 2 /?) - to the plane containing the magnetic 
field and the propagation direction. For intermediate values of the magnetic field 
the U Stokes parameter is non-zero, which means a rotation of the plane of linear 
polarization. The rotation angle is a complicated function of /? and H , as apparent 
from Eqs. (5.106). 

This example shows that in some scattering configurations the magnetic field is 
able to produce a definite amount of linear polarization in a radiation beam which 
would be unpolarized if the magnetic field were not present. 
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5.10. The Scattering Phase Matrix Expressed in Terms 
of Rotation Matrices 

The equations that we have derived in the preceding sections involve the evaluation 
of several direction cosines relating the unit vectors u a , relative to the magnetic 
field, to the unit vectors e)(fi) needed for the definition of the Stokes parameters. 
These direction cosines and, in particular, their bilinear combinations can be conve- 
niently expressed in terms of ordinary rotation matrices. This leads, in most cases, 
to much more compact and handy expressions for the relevant physical quantities, 
like the scattering phase matrix derived in Sect. 5.7. 

The introduction of rotation matrices requires, however, some remarks about 
the spherical components of ordinary vectors, an argument that we have already 
treated in Sect. 2.7 and that we are going to develop in further detail here. 

Given an arbitrary vector v, its spherical components v q are defined by (see 
Eqs. (2.82)) 

'■■1 = v y ) 

v o = V z 

Vi = -^K + i O’ (5.107) 

where v x , v y , v z are the Cartesian components in a right-handed coordinate system. 
Since in the following we will often deal with vectors of the form 

v = a + i b (5.108) 

with a and b real, we want to establish some properties of the spherical components 
of such vectors. 

Taking the complex conjugate of Eqs. (5.107), with v given by Eq. (5.108), we 
have 

O-l)* = [K + b y) - i (K - a y) ] 

Oo )* = a z~ ib z 

(O* = ~-j= [(a x -b y )- i (■ b x + a y )\ . (5.109) 

On the other hand, taking the complex conjugate of Eq. (5.108), 

v* = a — ib , 

we have for the spherical components of the vector v * 

[K ~b y )~ i [b x + a y )] 

(Oo = a z ~ [ b z 

(v*)i = [(a x + b y )~ i (b x - a y )] . 



(5.110) 
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Comparison of Eqs. (5.110) and (5.109) shows that 

K), = (-i) a (f_,r , (5-in) 

a formula that could also be deduced by application of Eq. (2.85) to ordinary vectors 
(tensors of rank 1). Thus the complex conjugates of the spherical components of 
the vector v do not form an irreducible spherical tensor; nevertheless they are 
connected with the spherical components of the vector v * by the simple relations 

( 5 - 111 ). 

From now on we will use for simplicity the symbol v* (without parentheses) to 
denote the complex conjugate of the spherical component of the vector v, 

v* q = (v q )* ■ 

Consider now the direction cosines C ai defined in the former sections. Since the 
scalar product of two vectors v and w can be written in the form (see Eqs. (2.83)) 

v-w = ^2(-l) q Vq w_ q , (5.112) 

we have 

c ai = U a • e* = £ (-iy (u a ) q (e*)_ q = Y, M q (e,)* , (5.113) 

Q Q 

and for a bilinear product of two direction cosines 

qq' 

This expression suggests the introduction of a tensor - the dyadic product of two 
irreducible spherical tensors of rank 1 - which plays a very important role in the 
description of polarization phenomena. Indeed, as we will see in the following, this 
tensor - and the analogous ones that will be derived from it - appears as a natural 
ingredient of almost any mathematical expression relevant to polarized radiation. 
For this reason it deserves a careful definition. 

In the right-handed reference system ( xyz ) of Fig. 5.14 we consider a particular 
direction characterized by the real unit vector c. In the plane perpendicular to c 
we introduce two real unit vectors, a and b , such that a, 6, and c form, in this 
order, a right-handed coordinate system. The orientation of (a, b , c) relative to 
(xyz) is specified by the angles 0, an d 7 defined in Fig. 5.14, with 

0 < 9 < 7i , 0 <x<27t, 0<7<27t. 

From the unit vectors a, 6, c we then define the unit vectors c_ 1 , c 0 , c +1 via the 
equations 

c_i = (a +ib) 
c 0 = c 

c +1 = -j=(-a + \b) , 



(5.114) 
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Fig. 5. 14. Geometry for the definition of the tensor £ , (a, /3, c). 



and, finally, we define the reducible spherical tensor £ qq , relative to the triplet 
(a, b, c) as 1 

£ qq '((*,0,c) = (cj g (cp)*, (a, P = ±1; q,q' = 0,±1) . (5.115) 

The explicit expression for £ qq , (a,/3, c) can be found by the following argument. 

In the reference system ( a,b,c ) we have, from Eqs. (5.107) applied to the unit 
vectors defined in Eqs. (5.114) 

(^o)g ^ aq > ^0q f ' 

Since (c a ) q is an irreducible spherical tensor of rank 1, its expression in the reference 
system ( xyz ) of Fig. 5.14 can be obtained by application of the transformation law 
of irreducible spherical tensors under rotations. From Eq. (2.78) we get 

(cj , = E S « P D U R ) = V U R ) ( 5 - 116 ) 

p 

and, similarly 

M * q ,=vl q ,{Ry , 

The quantity S qq/ is neither an irreducible spherical tensor nor the dyadic product of 
two irreducible spherical tensors. The denomination ‘reducible spherical tensor’ is due to the fact 
that it can be reduced to the linear combination of irreducible spherical tensors (see Eq. (5.125)). 
Note also that S qq , depends not only on the direction of the unit vector c (specified by the angles 
6 and y), but also on the direction (specified by the angle 7 ) of the unit vector a in the plane 
perpendicular to c. To shorten notations, we keep only c as an explicit argument of the tensor. 
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where R = (—7, — 0, — x) is the rotation bringing the system (a, b, c) into the system 
{xyz). Thus the expression for £ qq ,{a, f3,c) in the reference system ( xyz ) is 

£ qq ,(a,(3,c) = Vl q (R) Vl q ,{RY . (5.117) 

Let’s now go back to the scattering phase matrix Tappearing in Eqs. (5.86) and 
(5.87), and consider the case where the polarization unit vectors (e 1 , e 2 ) relative to 
the scattered radiation and the analogous vectors (e-^ej) relative to the incident 
radiation are defined by the expressions (note the similarity with Eqs. (5.114)) 



_ _ 1 

?1 - 72 L' 
_ _ 1 
22 ^7^ L 

J. 

1 V2 L 

_ 2 
V2 L 



e a (fi) + ie b (fi)J = e.^fl) 

~ e a(^) + * e b(^) = e +l(^) 

%(&) +!%{&)] 

-%(&) + i? b ({i'j\ =Z' +1 {&) 



(5.118) 



where e a (f2), e b (Q), e' a (Yl'), e b (f2 / ) are the reference direction unit vectors and the 
associated unit vectors defined as in Fig. 5.9. 

With this choice Eq. (5.86) can be rewritten as 



dV(tt) 



CT cl 



An' 

I' pa {Q.') — dO 



where y, v 1 p 1 a = ±1, with the elements of the matrix T given by (see Eq. (5.113)) 

= § E E Ml ( e , W (u a ) q „ )*„ ( U/3 )y (e' CT (^))^ 



a/3 qq’q"q" 



1 



1 + i (a — /3)i7 



(a,/3 = 0, ±1) . (5.119) 



This formula can be evaluated in any reference system. If we choose the reference 
system (u r ,u s ,u 0 ) used in Sects. 3.2 and 5.3 we simply have 



( U oc)q = S a q > C U ())q' = 5 0q' ■ (5.120) 

Thus Eq. (5.119) can be written, with the help of Eqs. (5.115) and (5.117), in the 
form 



T 

liv, per 







£ 99 '0,^) £ q ’q{v, P , ^') 
l+i(q-q')H 



3 v v lA R Y Kq( R T 

2 ^ 1 + i (q-q')H 

qq 



(5.121) 
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where R is the rotation bringing the system (e a (Q) , e b (Q) , Q) into the system 
(u r , u s , u 0 ) and, similarly, R' is the rotation bringing the system (e' ($2'), e^fi'), U) 
into the system (u r ,u s ,u 0 ). 

In the general geometrical configuration of Fig. 5.9, R and R' can be expressed 
in the most convenient way as the result of two consecutive rotations, 

R= (-7,-0,-x) x (xb>0b><x) 

R' = (-y, - x ') X (xb^b, «) , ( 5 - 122 ) 

where a is an arbitrary angle whose actual value is unimportant for the evalua- 
tion of the quantity T pv pa and that can be set equal to zero. In the geometrical 
configuration of Fig. 5.10, R and R' are simply given by 

R=(0,-9,-x) , R' = ( 0 , -e\ - x ') - ( 5 . 123 ) 

As shown in Sect. 2.7, the dyadic product of two irreducible spherical tensors of 
rank k and k' can be used to construct an irreducible spherical tensor of rank I\ . 
Therefore, recalling Eq. (5.111), one can obtain an irreducible spherical tensor by 
taking appropriate linear combinations of the quantities £ ,(a, 0,c) defined in 
Eq. (5.115). According to Eq. (2.79) such combinations are given by 1 

£« (a , /?,<?) = £ (-1) 1+ V3(2 K + 1) Q R Q ]£ qq ,(a,(3,c) , (5.124) 

with the inverse transformation 

£ qq , (a, /?, c) = £ (-1 ( \ Kq)£q («. /?,*)■ (5.125) 

In terms of rotation matrices, one gets from Eq. (5.117) 

£q («, /?,<?) = £ (~1) 1+ V3(2 K + 1) Q ^Av^WV^iR)* , 

qq' V 2 

which, using Eqs. (2.77), (2.23a), (2.73) and the fact that a and f3 can only take 
the values ±1, can be written in the form 

£$(a,0,e) = y/3(2K+l)(^ K q ^JV« q {R) (a,/? = ± 1) . (5.126) 

1 The linear combinations of Eq. (2.79) have been multiplied by the factor f(K) = — ^ 3(2K+1 ) . 
This slightly simplifies some expressions that will be deduced in the following. 
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Substitution of Eq. (5.125) into Eq. (5.121) leads to the expression of the scat- 
tering phase matrix T in terms of irreducible spherical tensors 



W(M';^) = 

= \ E (-!) q £ q fa «) S -Qfa P . Y-y7 



= |E(- 1 ) q ( 2A ^+ 1 ) 



11 K \ f 11 A' 



p —v —Q'J \<j —p -Q 






(5.127) 



where R and R' are the rotations given in Eqs. (5.122) or (5.123). 

To deduce the scattering phase matrix R for the Stokes parameters we need the 
relation between the Stokes parameters and the polarization tensor corresponding 
to the basis of unit vectors (5.118). This relation can be obtained from Eqs. (1-42) 
by the substitution 6 = 57t/4, <j> = jt/2 . We have 



/=/+++/__ Q= -(/+_ + J_ + ) 

y=i ++ -i- u= -i (/ + _-/_ + ), 

where we have shortened the notations writing I , , instead of / +1+1 and so on. 
The inverse transformations are 

I ++ = \(I + V) i + _= l -(-Q + w) 

I- = \{I-V) I_ + = \(-Q-W). 

If we take an appropriate representation for the Pauli spin matrices (which differs 
from that of Eqs. (1.17)) we can write these transformations in a more compact 
form. Defining 




with 

o’o + i E & k (b j,k = 1,2, 3) 

k 



and 



Tv(a k a l ) = 25 kl (k,l = 0,1, 2, 3), 
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and labelling the rows and columns of the matrices in the order (— , +) (so that, for 
instance, (dj, = (d i ) 2 1 )> we have 1 

1 3 

I cc0 = 2 S 3 (^)a/ 3 («. /3 = ±0 

3=0 

Sj = E (^)a/3 J /3a O' = 0, 1, 2, 3) . (5.129) 

a/3 



Since these transformations are identical to (1-40) provided the matrices cr i are 
exchanged with cr i , the scattering phase matrix for the Stokes parameters can be 
easily obtained along the same lines of Sect. 5.7. The result is 

lMlsp<J=± 1 



or, using Eq. (5.121) 

R ij (n,n'-B) = 3j2 

qq' 



T qq ,(i,n)T qlq (j, n’) 

l + i(q-q')H 



(5.130) 



where we have defined the tensor T ,(i, O) as 

a/3=±l 



From the tensor T , it is possible to construct - similarly to what done for 
£ , - an irreducible spherical tensor, Tq , that will play an important role in the 
continuation of this book 



T£(z, ) = E (-!) 1+ V3(2^ + 1) ( \ } q , _^)r g9 ,(i, ft) 

qq' ' 2 

=E £ } 5 ) 

qq' c*/3=±l V 7 

X \ (di) a/3 ^ 9 /(/3,a,0) 

= E \ £qW,01,^) • 

a(3=± 1 



(5.132) 



Equations (5.129) have the same form as Eqs. (1.40). Note, however, that they involve the 
polarization tensor defined on the basis of the unit vectors (5.118), while the polarization tensor 
in Eqs. (1.40) is defined on the basis of the reference direction unit vector and the associated unit 
vector. 
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Inversion of the first line of Eq. (5.132) gives 



t ot ,M) = £(- i) 

KQ 






which can be substituted into Eq. (5.130) to obtain the expression of the phase 
matrix R in terms of the irreducible spherical tensor Tq 



Rijiti, = £ (-1) Q fi) T- Q U, no 

I<Q 



1 

1 + i QH ' 



(5.133) 



Using Eq. (5.126) the tensor Tq can be explicitly written in terms of rotation 
matrices 

T I <{i,n) = Y J tf(i)V% Q {R), (5.134) 

p 

where 




(5.135) 



From this equation, using some properties of the rotation matrices (Eqs. (2.71) and 
(2.73)) and noticing that tp(i)* = Eq. (5.133) can be rewritten in the form 



Raft, a';B)= £ *p(0 tp'UT £ v$ Q {R) vgA#- 1 ) 



KPP' 



1 

1 + i QH 



(5.136) 



which is our final equation expressing the phase matrix R in terms of rotation 
matrices. In this equation we can isolate the contributions of the various Tf- values 
(sometimes referred to as the multipole components of the phase matrix) 






(5.137) 



K 



with 



r: 



(K) 



(U, i M) = £ (-1) Q Tq (i, n) T« Q {j, Q,') — 
Q 



QH 



= £ #(0 tp'UT £ Rp Q (R) Rqp'(r'~ 1 ) 1+iQH ■ ( 5 - 138 ) 



PP , 



The tensors introduced in this section satisfy several properties, which are col- 
lected in the following section together with a number of tables that are particularly 
useful for applications. With the help of such tables we can easily get back expres- 
sions (5.99) which give the scattering phase matrix in the reference system with 
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the 2 - axis parallel to the magnetic field (see Fig. 5.10). In terms of multipole 
components we get 



R (0> (Q,Q';B) = 




R :i) (n,n';B) = ^ ( w z + Twz 2 c x ) 

/ (1 — 3/x 2 )(l — 3 /lx / 2 ) (1 — 3/r 2 )(3 — 3// 2 ) 0 0\ 




1 



R m {( %&-,$) = - 



(3 — 3/z 2 )(l — 3// 2 ) 
0 
0 



+ 2 x/ 1 - M 2 x/ 1 - /Z : 



+ - 



,/ 2 \ 



(3 - 3/r 


! )(3 — 3/x' 2 ) 0 


0 




0 


0 


0 




0 


0 


0/ 




m>! c x 


~/iS 1 


°\ 


/V c x 


w'Ci 


-/iS x 


0 


v!Sx 


I*’S 1 


Cl 


0 


V o 


0 


0 


0 / 


+ A Z 2 )C 2 


2(1 - /F 


)v'S 2 


°\ 



-2/x(l - /Z 2 )S 2 
0 



2/r(l + h' 2 )S 2 
0 



— 2(1 + ^ 2 )fj,'S 2 0 
4fj,fi'C 2 0 

0 0/ 



(5.139) 



where the notations are the same as in Eqs. (5.97). Adding the contributions of 
the three matrices R (0) , f? (1) , and R (2 \ Eqs. (5.99) are easily obtained. 



5.11. Spherical Tensors for Polarimetry 



The various tensors defined in the former section are particularly suitable for the 
description of polarization phenomena in spectral lines. As they will be heavily 
used in the following of this book, their properties deserve a thorough discussion. 
At the same time, we will take the opportunity of introducing some new tensors 
that will also be useful later. 

Starting with the reducible tensors, we have defined 

= ( e «(^)) ? ( e /?(^))q/ (q,q'= 0, ±1; a,P = ± 1) (5.140) 



with 



e ±i(^) 



1 

71 



“F e a (^) + i e b(^) > 



(5.141) 
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and we have obtained, in terms of rotation matrices (see Eq. (5.117)) 



£ qql (a,P,n)=V 1 aq (R) T>p q ,(R)* , 



(5.142) 



where 



R= (-7,-0,-x) 



is the rotation bringing the coordinate system (e 0 (fi), e b (fi), $ 1 ) into the system 
(. xyz ) (refer to Fig. 5.14 with (a,b,c) replaced by (e 0 ($2), e b (U), $1)). 

In terms of reduced rotation matrices (see Eq. (2.68)) one gets, with the help of 
Eqs. (2.70) 

c ( a AC i [(9-?')x+ («-/3 )t] ,1 ™ ,1 (n-, 

£ qql (a,(3,n) = e d qa {9) d ql0 {9) . 

From Eq. (5.142), using Eqs. (2.72), one obtains 

E v lA R y > 



and by application of the Weyl theorem (Eq. (2.76)) 



E f^ e *A a ' a &) = \& qq > ■ 

Li ** 



(5.143) 



Again from Eqs. (2.72) one gets the relation 






(5.144) 



and directly from the definition (5.140) 



£ = £ , (/3, a, . 



(5.145) 



The explicit expression for the tensor £ qq , (a, /3, f2) as a function of y, 0, and 7 is 
given in Table 5.2. 

The other reducible tensor that we have introduced is T ), defined by (see 
Eq. (5.131)) 

a/3=±l f«-U,...,d) . 



Taking the complex conjugate, we have from Eq. (5.145) 



VM)* = E 



a/3=± 1 



(5.146) 
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TABLE 5.2 

Explicit expression for the tensor £ qq ,{a,f3, f2). The remaining components can be obtained from 
the relation £ q , q (a, /3,fi) = £ / (/3, a, f2)*. 

£_ 1 _i( — , f 2 ) = | (l + cos 0) 2 
= \ sin 2 6 

£ n(-’ - ’^) = T (1 — cos0) 2 

£_io(- - S) = “ 575 sin0 (1 + cos0) e“' x 

£_n(-,-,^) = \ sin 2 0 e“ 2lx 

£q i ( ’ i = “575 sin0 (1 " cos^) e -1 * 

£—1 _1 ( — r> + , A2) 

£ oo( - ’+> n ) 

£ 1 x (— , +, f2) 

i o( — ’ 4- ) Al) 

£-1 l( _ 1 +! 

^0 l( — > +> 

£_1 _l(+) — 1 fi) 

£ oo(+’~> a ) 

l( + ) — ! SA) 

£_i — 5 

£_i i(+j — i £2) 

£oi(+> - ’ fi ) 



£_1 _l( + ! +> 


= \ (1-COS0) 2 


^Oo( + ’ +’ 


= | sin 2 0 


i(+, +> £2) 


= | (1 + cos 0) 2 


£_1 0 (+! +! ^2) 


= ^A_ s i n 0 ( 1 — cos 0 ) e _lx 


£_i 1 (+, +, 12) 


= A sin 2 0 e~ 2ix 


£ 0 l( + > +! 


= jA_ s i n 0 ( 1 + cos 0 ) e _lx 



= 4 e -2 ' 7 sin 2 0 
= — 4 e -2 ' 7 sin 2 0 
= 1 e -2 ' 7 sin 2 0 

= ^Aj e -2 ' 7 sin0 (1 + cos 0) e _lx 
= 4 e -2 ' 7 (1 + cos0) 2 e _2lx 
= — -^A= e~ 2 ' 7 sin 0(1 + cos 0) e _lx 

= \ e 2i7 sin 2 0 
= -i e 2i7 sin 2 0 
= 4 e 217 sin 2 0 

= — = e 2 ' 7 sin 0(1 — cos 0) e _lx 

= 4 e 217 (1 — cos 0) 2 e _2lx 
= — e 217 sin0 (1 — cos0) e _lx 
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and since the matrices cr i obey the conjugation property 



we get 



(^)q :/ 3 i) (3a ’ 


(5.147) 


T qq ,(i,fiy = r q , q (i,fi) . 


(5.148) 



Another property can be obtained from Eq. (5.144) using the trace properties of 
the matrices &, 



Y T qq( h = Y \ = \ Tr ( & i) = S i0 ■ (5.149) 

q a=±l 

The explicit expression for the tensor T qq ,(i, Cl) is given in Table 5.3. 

Using the tensors £ qq , (a, /3, (l) and T qq , (i, Q) we can construct a further reducible 
tensor, T qq ,(y, Cl), which describes the polarization properties of the radiation field 
propagating along the direction Cl. If I a p{v, Cl) is the polarization tensor (defined on 
the basis of the unit vectors e ± 1 (U) of Eq. (5.141)) and Sj(zA Q) is the Stokes vector, 
both relative to the frequency v and the direction Cl, we define the tensor I , (y, Cl) 
according to either of the equivalent relations (see Eqs. (5.129) and (5.146)) 

I qq '(v^)= £ qq '( a >P>ti) I p a { v &) 

a(3=± 1 
3 

= YTqq'(iA S i(v,ti) ■ (5.150) 

i = 0 

This tensor satisfies the conjugation property (easily deduced from Eq. (5.148)) 

l qql (v,Ciy=l q , q (v,Cl), (5.151) 

and the trace property (consequence of Eq. (5.149)) 

Y = S 0 M) = I(u,n) . (5.152) 

The explicit expression for the tensor T , (v, Cl) is given in Table 5.4. It is important 
to remark that the linear polarization Stokes parameters, Q and U, enter the 
expressions of Table 5.4 only via the linear combinations 

Q = cos 2q Q — sin 2q U , U = cos 2y U + sin 2 7 Q . 

These combinations are independent of the angle 7 which defines the reference 
direction for the Stokes parameters Q and U. 
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TABLE 5.3 

Explicit expression for the tensor T qq ,(i,Cl). The remaining components can be obtained from 
the relation 7^, (i, S2) = T ,(i, Cl)* . 



i_i(o,n) 


= 1 (1 + COS 2 0) 


7 oo(°> 


= \ sin 2 0 




= \ (1 + cos 2 6>) 


^~_i o (O’ 


= — ——7= sin # cos 0 e~ lx 
2 V 2 


TLi i(o, 


= i sin 2 0 e _2lx 


T 0 i(°,Q) 


= —^-r= sin 0 cos 6 e _lx 
2V2 


T_i_i(l,S) 


= — i cos 27 sin 2 0 


^ool 1 ’ 


= — cos 27 sin 2 # 


^iiM) 


= — i cos 27 sin 2 0 




= — (cos 27 cos # — i sin 27) sin # e — 1X 


TLn(i^) 


= — i [cos 27 (1 + cos 2 0) — 2i sin 27 cos d] e~ 2lx 


T 0 i(LQ) 


= (cos 27 cos 0 — i sin 27) sin# e — 1X 




= i sin 27 sin 2 0 


Tqq( 2 , £2) 


= — i sin 27 sin 2 0 


i(2, ^2) 


= 7 sin 27 sin 2 0 


siioM) 


= ( s * n cos ^ i cos 27) sin # e — 1X 


TLi i(2j 


= j [sin 27 (1 + cos 2 0) + 2i cos 27 cos e _2lx 


^0 l(^j 


= — (sin 27 cos # + i cos 27) sin 6 e~ lx 


^~-i _i( 3 j 


= — ^ cos # 


7 ^ 0 ( 3 , fi) 


= 0 


i( 3 , £2) 


= ^ cos# 


o( 3 ? 


= j>= sin 0 e -ix 
2V2 


T_i 1 (3, £2) 


= 0 


7q 1 (3, £2) 


= 7 = sin# e _lx 

2^2 
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TABLE 5.4 

Explicit expression for the tensor X qq , (is,Q). The remaining components can be obtained from 
the relation X , (is, £2) = Z qq , (is, f2)*. 

T_ 1 —1 (is, Q) = j [(1 + cos 2 #) I (is, Q) — sin 2 # Q(is, Q) — 2 cos# V(is, 0)j 
X 00 (is,Q) = ^ sin 2 6 [i (is, Q) + Q (is, 

X 1 3 (is,£l) = ^ [(1 + cos 2 #) I (is, f2) — sin 2 # Q(is, Q) + 2 cos 6 V (is, £7)] 

X —1 q(*7 S2) — — t = sin # [— cos # 1 (is, Q) — cos # Q(is, £2) + i U(is, Q) + V (is, £})] e~ lx 

X_j 1 (is, £2) = j [sin 2 # 1 (is, £2) — (1 + cos 2 #) Q(is, £2) + 2i cos# U(is, £2)j e — 2lx 

Xq 1 (is, £2) = sin # [cos # 1 (is, £2) + cos # Q(is, £2) — i U (is, £2) + V (is, £2)] e~ lx 

where Q(i/, £2) = cos 27 Q(is, £2) — sin 27 U (is, £2) 

U (is, £2) = cos 27 U (is, £2) + sin 27 Q(is, £2) 



Finally, from the tensor X qq ,{y , ft) one can define the tensor J qq , (v) by averaging 
over the whole solid angle 

J,A V ) = / S VM) = /^E T 99 'M) S AA) ■ (5.153) 



This tensor depends on the polarized radiation held propagating in all directions, 
and will be called the ( reducible ) radiation field tensor. Its trace is given by (see 
Eq. (5.152)) 

E J qqA) = = J{v) , 

which is the usual definition of the mean intensity of the radiation held over the 
solid angle. From Eq. (5.151) it follows the conjugation property 



•V(0* = . W*')- ( 5 - 154 ) 

All the tensors defined so far are reducible tensors. Under rotation of the coor- 
dinate system the new components, T ' qq , , are obtained from the old ones, T pp , , by 
the transformation (see the derivation of Eq. (5.117)) 



T A = E t pp , , 

pp' 



where a, (3, 7 are the Euler angles of the rotation which brings the old reference 
system into the new one. 
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We can easily construct the associated irreducible tensors through the standard 
procedure (see Eq. (5.124)). By so doing, the following irreducible tensors are 
obtained 

££(<*,/?, 6), T£M), t£M), J*(v) 

corresponding to 

T qq '{h&), J qq '{v) , 



respectively. The relations between the irreducible tensors and the corresponding 
reducible ones are repeated here for the sake of completeness 



T Q =£(-l) I+ V3(2A- + l)Q j _q)t„. 

qq' V 7 

with the inverse transformation 




(5.155) 



(5.156) 



Obviously, the various irreducible tensors are connected with each other by the 
same relations which connect the corresponding reducible tensors, namely (cf. 
Eqs. (5.131), (5.150), and (5.153)) 

t£M)= E 

a/3=±l 

3 

2qM)= E 

a/3=± 1 i=0 

3 

Jq M = / ^ X Q M)= X Q (*’ «) O) ■ (5.1.57) 

i=0 

Their conjugation properties can be deduced from the corresponding properties of 
the reducible tensors, and are found to be 

^(a,/3,f2)* = (-l)Q55 Q (/3,a,!l) 

^(i,fS)* = (-l)«7*(i,ji) 

j£M)* = (-i)«t5 q M) 

JgM* = (-l)^-oM- (5-158) 

We rewrite here, for the sake of clarity, the expressions of the tensors Eq (a, 0, O) 
and Tg(i,Q) in terms of rotation matrices (see Eqs. (5.126), (5.134), and (5.135)) 

£$(a, 0 ,n) = y/ 3 ( 2 K+l)(j x _q/) Vq>q(R) (a ,0 = ±l); 



= ^pq(R) 

P 



(5.159) 
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Fig. 5. 15. Reference direction unit vector (e ) and associated unit vector (e b ) relative to the 
directions Cl and —Cl . e (Cl) is parallel to e (—Cl). 



with 




(5.160) 



where R is the rotation bringing the reference system (e ($2), e 6 ($2), O) into the 
system ( xyz ). 

One can easily deduce some remarkable properties of the coefficients tp(i ): 

a) The only non-zero components of tp(i) are those with P = 0, ±2, as the indices 
a and (3 can only take the values ±1. 



b) From Eq. (5.147) and from the properties of the 3 -j symbols it follows the 



conjugation relation 


*£(*)* = *-p(0- 




c) Defining the quantities 


^ = (1,1, 1,-1) 
7* = (1,1, -1,1) 






C* = (1,1, -1,-1) = tiP 


(5.161) 


with 7 = 0,..., 3, one gets 






i^i) — a— 0 0a > 


"Ca 

II 

P 




and hence 


t K P {i)=^) K t K p(i) 

tp(i) = T i t-p(i) 

t K P {i) = C i {-V K t K -p(i)- 


(5.162) 



These properties allow a simple derivation of the relation between the tensors 
Tq ( i , $2) and Tq(i, — fi), which will be needed in the following. 

Let us consider two opposite directions, Cl and —Cl, and the corresponding ref- 
erence direction unit vectors e a (Cl), e a (—Ci). We assume for simplicity that the 
directions e a ($2) and e a (— O) coincide (see Fig. 5.15). From the definition (5.159) 
we have 
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TABLE 5.5 

Values of the symbol tp (i) 



i 


K = 0 


K = 1 


I< = 2 


0 


tip, 0 


0 


75 s p, o 


1 


0 


0 


— (^P,-2 + ^>, 2 ) 


2 


0 


0 


* ~ir (^p, — 2 — ^>, 2 ) 


3 


0 


\f% &P, 0 


0 



T«(i,-n) = J2tp(i) 'Dpcji.R') i 

p 

where R' is the rotation bringing the system (e a (— U), e b (— U), — fi) into the system 
( xyz ). The rotation R' can be executed in two steps: first a rotation R 0 which 
brings the system (e a (— fi), e b (— fi), — U) into the system (e 0 (fi), e b (U), fi), and then 
a rotation R which brings this last system into (xyz). Thus from the closure 
property of the rotation matrices (Eq. (2.74)) we have 

= Y, E VpQ'W o) V Q'q( R ) ■ 

P Q> 

On the other hand, the rotation R 0 is given by 

R 0 = (7T,7T,0) , 

and since (see Eq. (2.70)) 

0) = e- i7rP d« QI (n) = e _i7rP (-1) A ' +P S Pi _ Q , , 

we obtain 

T«(i, — n) = Y tp(*) e_i " p (-1) K+P V-pq(R) ■ 

p 

Using property a) of the coefficients tp(i) (see above) and the last of Eqs. (5.162) 
we finally get 

T«(i,-n) = <; i T«(i,n). (5.163) 

It should be remarked that this relation is valid only when the directions e a (f2) 
and e a (— U) coincide. 

Let us turn to the evaluation of the irreducible tensor Tq (i,Q,). From Eqs. 
(5.160), (5.128), and (2.26) we can easily obtain the values of the coefficients tp(i), 
which are collected in Table 5.5. The explicit expressions for Tq ( i. Cl) can then 
be derived from the definition (5.159) by direct evaluation of the rotation matrices 
(Eq. (2.68) and Table 2.1). The results are contained in Table 5.6. 
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TABLE 5.6 



Explicit expression for the tensor Tq(i, Q). The components with negative Q- value can be ob- 
tained from the relation O) = (—1)^ Tq (i, O)*. The angles 7 are shown in Fig. 5. 14. 

Note that the rotation carrying the reference system (a, b, c) into ( xyz ) is R = (— 7 , —0, — x)- 



TO (0,Q) 


= 1 




7M) 


= 0 




7(0, 0) 


= 0 




7M) 


= 571 (3cos2e “ 1} 




7(0,0) 


= — sin 9 cos 8 e' x 




T|(0,Q) 


= sin 2 0 e 2lx 




7 7^) 


= 0 




7M) 


= 0 




7(i,q) 


= 0 




7M) 


= cos 27 sin 2 0 




7(1,0) 


= — ^ (cos 27 cos 8 4 


i sin 27) sin 6 e lx 


T| (1,0) 


= — [cos 27 (1 + cos 2 6 ) 4 2i sin 27 cos e 2lx 


7° M) 


= 0 




7(2,0) 


= 0 




7(2,0) 


= 0 




T q 2 (2,0) 


= —7 sin 27 sin 2 0 

2 v 2 




7(2,0) 


= (sin 27 cos 9 — i 


cos 27) sin 6 e lx 


7(2,0) 


= [sin 27(1 + cos 2 


9) — 2i cos 27 cos c 2lx 


7° (3,0) 


= 0 




7(3,0) 


= cos ® 




7(3,0) 


= — sin 9 e lx 




7(3,0) 


= 0 




7(3,0) 


= 0 




7(3,0) 


= 0 
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TABLE 5.7 



Explicit expression for the tensor Tq(v,Q,). The components with negative Q-value can be 
obtained from the relation X^q(v,VL) = (— 1)^ Tq(v, SI)*. 




The derivation of the explicit expression for the tensor Xq (v,Q,) defined in Eq. 
(5.157) is straightforward (see Table 5.7). Note that the expressions in Table 5.7 
- like those of Table 5.4 - contain only the linear combinations Q and U, which 
are independent of the reference direction chosen to define the Stokes parameters 
Q and U. 

Finally, the irreducible tensor of the radiation field Jq (y) is obtained by averag- 
ing Xq (v. fi) over the solid angle. In the particular case of an unpolarized radiation 
field having cylindrical symmetry about the 2 -axis only two components are not 
zero, namely 



^oV) = 7 (^ 0 ) 

J o( u ) = (3cos 2 6> - 1 )I{v,9) . (5.164) 

Obviously, the component Jq(p) is also zero if the radiation field is isotropic. 



5.12. Further Properties of the Scattering Phase Matrix 

The expression given in Sect. 5.10 for the scattering phase matrix and the various 
relations proved in Sect. 5.11 allow a further extension of the remarkable properties 
of this matrix. 

Starting from Eq. (5.133) and using Eq. (5.163) one can directly prove the fol- 
lowing relations (which are however valid only when the same reference directions 






MATTER-RADIATION INTERACTION (CLASSICAL) 



213 



are chosen for Cl and — Cl, and for Cl' and — £2') 

R ij (ti,-ti'-,B) = <; j R ij (ti,ti'-,B) 

The first relation means that if (/, Q, U, V) is the Stokes vector scattered along the 
direction Cl, then, irrespective of the polarization of the incident beam and of the 
intensity and direction of the magnetic field, the Stokes vector scattered along the 
direction — Cl is (/, Q, — U , — V). The two other relations have analogous meanings. 

Considering next Eq. (5.136), we can write the expression of the scattering phase 
matrix for the most general geometry of Fig. 5.9. The two relevant rotations are 
(see Eq. (5.122)) 

R = (-7,-0,-x) x (Xb^ b ,1b) 

R' = (-Y, -O', -xO x (Xb^B’Ib) 

where Xb is an arbitrary angle, and hence from Eq. (2.74) we obtain 
R ij {C\,Cl'-B) = 

= E *p(0 E V PQ (-7 ~ e -X) M$q, 0) V« p ,{-x! o' i) , (5.165) 

KPP' QQ> 

where we have introduced the magnetic kernel defined by 

■Mqq'(B) = E ^ QQ"(Xb 0 b 1b) v q"q\~1b ~0 B ~Xb) 1 + j Qiift ■ ( 5 - 166 ) 
Q" 

In terms of reduced rotation matrices (Eq. (2.68)) we have 

= e i(Q '- Q)x ° E d QQ"( 9 B ) d«, Q ,{-6 B ) 1 + } Q „ H . (5-167) 

which shows explicitly that the magnetic kernel is independent of the angle 7 B , 
that can be set equal to zero. 

The magnetic kernel satisfies several important properties which can be easily 
derived from Eqs. (5.166) and (5.167) using some properties of the rotation matri- 
ces: 

i) Limiting case of zero magnetic field 
Recalling Eqs. (5.84) and (2.74) we have 

Hm^A 4qq,(B) = 'Dqq"{Xb Ob 6) ^ > q"q i ( 0 ~0 B ~Xb) 

Q" 

= V« Q ,(000) = 6 QQ , , 
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and hence 

6'; 0)= £ t*(i) t«(jy V* P ,(R) , (5.168) 

KPP' 

where R is the rotation bringing the reference system (e„(f2), e h (d), Cl) into the 

system (e'(Q'), %(&),&)■ 

ii) Limiting case of strong field, 

/]— ((c MqQ’ (R ) = RQoiXB ®B 0) R*0 Q'(0 —®B ~ Xb) ■ 



iii) Conjugation property 

Mq Q '(B)* = (-1 f-Q'M« Q _ Q ,(B) . 

iv) Magnetic field symmetries 

■Mqq'(B, tt 0 b , — Xb) = M^q-q' (B, 9 b , Xb) 

Mqqi{B, tt — 9 Bl tt — Xb) = ( — 1)^ ® ■^■-q_q'{B,9 b ,Xb) 
Mq Q i(B,6 b ,tt + Xb ) = (-1 ) Q ~ Q ' M% qi (B, 9 b , Xb ) , 

a set of formulae giving the transformation of the magnetic kernel under a 180° 
rotation of the magnetic field vector about the axes x, y , and z of Fig. 5.9, respec- 
tively. 

v) Magnetic field reversal 



M« q ,{-B) = 

= v QQ"^+Xb v-9 b 0) V%„ QI (0 -7 t+9 b tt Xb) 1 + ^ Q „ H 
Q " 

— £ 25 QQ"(Xb 9b °) -9 B “Xb) 1 _ i Q"H ’ 



(5.169) 



which shows that the reversal of the magnetic field can also be obtained by the 
formal substitution H — > —H. 

vi) Average over an isotropic distribution of magnetic fields 

For an isotropic distribution of magnetic field vectors we can find the average value 
of the magnetic kernel. Defining 



2tt 7 r 

(M« qi (B)) = -LJd XB J sinf 1 B M« Q ,(B)d9 B 
0 0 



(5.170) 
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we easily obtain, with the help of Eqs. (2.71) and (2.76) 

(■MqQ’(B)) = Sqq , , 

where the quantities n K are given by 



(5.171) 



— 



y — - 

4^ 1 + i< 



2K+1 j-f 1 + i Q"H 



or, explicitly 
1*0 = 1 



Mi = ^ (l + 2cos 2 a 1 ) = i 



1 + 



1 + H 2 
1 



H 2 = -y (l + 2 cos 2 aq + 2 cos 2 ai 2 ) = 

0 o 



1 + 



1 + H 2 1 + 4 H 2 



(5.172) 



(5.173) 



the angles a 1 and a 2 being defined in Eq. (5.98). 

This result shows that the average of the scattering phase matrix in an isotropi- 
cally distributed magnetic field can be obtained directly from the scattering phase 
matrix of the non-magnetic case by multiplication of the various multipole compo- 
nents by the quantities /j k , 



H'; B))=y H'; 0) . (5.174) 

K 



In particular, in the limiting case of strong magnetic fields the matrix i? (1) is 
reduced by a factor 3, while the matrix R ,2> is reduced by a factor 5. 



5.13. Understanding Scattering Experiments 
through Oscillator Models 

In the previous sections we have derived the expression for the scattering phase 
matrix in the presence of a magnetic field both in terms of direction cosines and 
in terms of rotation matrices. However, these derivations are rather involved and 
there is a danger that the physical meaning of the various results may be hidden 
by the mathematical formalism. Thus we think it is worthwhile to present some 
qualitative arguments - based on the atomic oscillator model - aimed at clarifying 
the underlying physics. At the same time we will draw some interesting analogies 
between the classical and quantum-mechanical descriptions of scattering phenom- 
ena. 

Let us consider a simple scattering event like that illustrated in Fig. 5.16a, left. 
The incident radiation beam is unpolarized (conventionally, this is represented by 
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two double-ended, perpendicular arrows which should be regarded as incoherent 
electric field oscillations), and there is no magnetic field. We schematize the scat- 
tering atom as a collection of three linear oscillators of frequency v 0 that will be 
called the x, y, and ^-oscillator, respectively. The scattering experiment can be 
described in the following way. 

i) The electric field of the incident radiation can be decomposed into its x and 
z components; owing to the unpolarized character of the radiation, the two com- 
ponents are incoherent, or, in other words, there are no phase relations between 
them. 

ii) The x component of the electric field excites the rc-oscillator, while the z compo- 
nent excites the z-oscillator (the latter is represented by a double-ended arrow in 
the right part of the figure) . The incoherent character of the electric field is simply 
transferred to the oscillators. 

iii) The oscillators decay with a damped motion and each of them emits, in any 
given direction, a radiation beam polarized according to the well-known laws of 
classical electrodynamics (see Eq. (3.29)). If we consider the beam scattered in 
the x direction, it is easily seen that the s-oscillator is ineffective (being pole-on), 
while the z-oscillator produces a radiation beam that is linearly polarized along the 
z direction. As a consequence, the radiation scattered in the x direction is 100% 
linearly polarized perpendicularly to the scattering plane, which is consistent with 
the results derived in Sect. 5.9 (see Eq. (5.103)). 

To investigate the effect of the magnetic field it is convenient to describe the 
model atom by a different set of oscillators, namely the linear a;-oscillator plus two 
circular oscillators with opposite directions (called a + and er_) laying in the y-z 
plane as shown in Fig. 5.16b, left. If the magnetic field is zero, the frequency of 
the three oscillators is still v 0 , and we must find the same results as above. 

Following the same line of reasoning, we see at step ii) that while the x component 
of the electric field still excites the x-oscillator, the 2 component excites the cr + and 
<j_ oscillators in such a way as to produce a well-defined phase relation between 
them, because the resulting motion of the electric charge takes place along the 
z direction (Fig. 5.16b, right). For the scattered radiation we obviously have the 
same result as before, but we must bear in mind that this new description of 
the atomic system has forced us to introduce the concept of phase relation (or 
coherence) between the two circular oscillators. 

In the presence of a magnetic field directed along the x-axis (see Fig. 5.16c) 
the circular oscillators vibrate at different frequencies ((y 0 + v L ) and (y 0 — is L ), 
respectively, with zz L the Larmor frequency) . Thus the phase relation produced by 
the exciting electric field is lost little by little during the damped decay process. The 
electric charge describes in the y-z plane a complicated pattern (sometimes called 
a ‘rosette’ - Fig. 5.16c, right), and thus the polarization of the scattered radiation 
- which reflects the weighted average of the pattern - is decreased and rotated 
from the direction of the non-magnetic regime: in Fig. 5.16c (left) the scattered 
beam is the superposition of a linearly polarized beam plus an unpolarized beam. 
The shape of the rosette is controlled by the parameter H = 2-kv^/^ defined in 
Eq. (5.84). When the value of H is very large ( H 1) the rosette degenerates 
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into a different curve (that we will rather call a ‘daisy’, see Fig. 5.16d right) and 
the scattered radiation is totally unpolarized. 

This simple example shows that the effect of the magnetic field is to cause a 
relaxation of the phase relations (or coherences) between the different oscillators: 
the coherence between the cr + and a _ oscillators is maximum for zero magnetic field 
(Fig. 5.16b) and gradually decreases with increasing field strength (Fig. 5.16c,d). 
The Hanle effect is just the consequence of this relaxation process in resonance 
scattering. 

Different geometrical configurations can be envisaged to get a deeper insight into 
the phenomena of resonance polarization and the Hanle effect. Figure 5.17 presents 
an example of forward scattering; here the scattered beam is unpolarized for zero 
magnetic field and partially linearly polarized along the field’s direction when the 
magnetic field is present. The discussion of this case, that can be carried out along 
the same lines as before, is left to the reader as an exercise (cf. Eqs. (5.106)). 

The classical model that represents the atom as a collection of one linear os- 
cillator and two circular oscillators of opposite directions has a simple quantum 
analogue. This is the two-level atom having a lower level of angular momentum 
J = 0 and an upper level of angular momentum ./' = 1. The excitation of one of 
the classical oscillators induced by the electric field of the incident beam has its 
quantum equivalent in the excitation of one of the Zeeman sublevels of the upper 
level. In this analogy, the linear oscillator corresponds to the sublevel M' = 0, while 
the circular oscillators of frequency (u 0 ± p l ) correspond to the sublevels M' = ±1, 
respectively. 

In scattering experiments, apart from special geometrical configurations, the var- 
ious oscillators are excited unevenly by the electric field of the incident beam, and 
well-defined phase relations arise between different oscillators. This means - in the 
language of Quantum Mechanics - that the upper level of the atom is polarized 
(see Sect. 3.6); in other words, the diagonal density-matrix elements 
are different from each other and the off-diagonal elements Pj,(M',M") are non- 
zero. In particular, the amount of excitation present in a given oscillator has its 
quantum analogue in the corresponding diagonal element of the density matrix, 
while the combined excitation of two different oscillators with a well-defined phase 
relation has its analogue in the off-diagonal element of the density matrix between 
the two corresponding sublevels. For example, in the case of panel (b) of Fig. 5.16, 
the non-zero density-matrix elements are p (0, 0), p (1, 1), p (—1, —1), p (1, —1), and 
p (— 1, 1), the last two elements being related to the coherence between the circular 
oscillators. Passing to panels (c) and (d) of the same Figure, the diagonal ele- 
ments remain unchanged, while the off-diagonal ones are reduced by the presence 
of the magnetic field. In the following of this book we will see that these qualita- 
tive concepts are fully confirmed by a more rigorous treatment based on Quantum 
Mechanics. 
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5.14. The Role of Collisions 

We present in this section a simple theory to describe the effect of collisions on 
polarization phenomena. Both the atom and the colliding particles will be described 
in terms of classical physics; in particular, the atom will be schematized as a three- 
dimensional oscillator, like in the previous sections of this chapter. 

Our first assumption is that the interactions with the closest perturbing particle 
(binary interactions) play the principal role. In other words, we assume that triple 
and multi-particle interactions can be neglected, which implies a limitation on the 
density n of perturbing particles in the medium, 

(T^ n<l 

where a is the typical collisional cross-section. This inequality is always well- 
satisfied both in laboratory plasmas and in stellar atmospheres. 

Next we distinguish between two different kinds of collisions, exciting collisions 
and perturbing collisions, defined according to the following simple model. 

a) Exciting collisions are inelastic, which implies an energy transfer from the col- 
liding particle to the atom. When an exciting collision takes place, the oscillator is 
abruptly forced to interrupt its previous motion, described by Eq. (3.23), and starts 
oscillating again with a new set of initial conditions. These initial conditions de- 
pend on the efficiency of the collision (which sets the amplitude of the oscillation), 
and on the geometry of the impact event (which sets the polarization characteris- 
tics of the oscillation). The geometry of the event is specified by the direction of 
the colliding particle, the direction of the vector joining the center of the oscillator 
with the center of the colliding particle at the moment of closest approach, and on 
the orientation of the colliding particle if this is not spherically symmetrical. 

If the colliding particles have an isotropic distribution of velocities, and - in the 
case of oriented particles - if they have random orientations, the initial conditions 
set up in the oscillator will be isotropic as well. Employing the same notations as 
in Sect. 3.2, the initial values of the amplitudes of the oscillations will be given by 

(KAp) = S afi W , (5-175) 

where the symbol (. . .) means a statistical average over all the possible collisions. 
This is just the physical situation described in Sect. 3.2. We remind the reader that 
exciting collisions, as defined here, have their quantum counterpart in collisions able 
to induce transitions between atomic levels. 

In different cases, like for instance when the atom is bombarded by a collimated 
and/or oriented beam of particles, Eq. (5.175) does not hold and, as a result, the 
atom emits polarized radiation in the decay process. This phenomenon is known 
under the name of impact polarization , a subject that will not be deepened here. 

b) Perturbing collisions, on the contrary, are elastic collisions which are not able to 
interrupt the oscillatory motion of the atom and to make it restart from scratch, 
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but are effective in shifting the phase of the oscillation during a time interval t c 
that is called the collision time. 

Following a model that was proposed by Lorentz (1915) and refined by Weisskopf 
(1932) and Lenz (1933), we suppose that the collision time t c is much smaller than 
the other typical times involved in the physical process, namely the decay time 
of the oscillator l/y and the mean time between collisions 1//, where / is the 
frequency of perturbing collisions 1 

t < - 3 t c <C \ . (5.176) 

7 / 

This hypothesis allows to treat the collisions as instantaneous ( impact approxima- 
tion). We further assume that each perturbing collision produces a large phase 
shift in the atomic oscillation, so that the oscillations before and after the collision 
are totally uncorrelated ( strong collision hypothesis). These two assumptions are 
at the basis of the usual treatment of collisional broadening in spectral lines (see 
e.g. Sobel’man, 1972) and can also be used to derive the effect of collisions on 
polarization phenomena. 

To this aim, however, the second assumption must be suitably specified to de- 
scribe with greater accuracy the geometrical details of the collisional event. We 
suppose that each collision, characterized by its own geometry, induces an indepen- 
dent phase shift in each of the three components of the oscillatory motion along a 
triplet of real, orthogonal unit vectors whose orientation is fixed by the direction of 
the velocity of the colliding particle (and by its orientation, in case of an oriented 
particle). These unit vectors will be denoted in the following by the symbol t t 
[i = 1,2,3), and the corresponding phase shifts by <f> Now we are going to derive 
the consequences of this simplified model on the results obtained in the previous 
sections of this chapter. 

i) Radiative Transfer Equations. 

In the derivation of the radiative transfer equations for polarized radiation in the 
presence of a magnetic field (Sect. 5.3) we found in Eqs. (5.29) and (5.30) the 
relation between the amplitude of the oscillation and the corresponding component 
of the forcing electric field. This relation is based on a particular solution of the non- 
lromogeneous equation (5.28) which disregards the transient solution depending 
on the initial conditions. In the presence of collisions the transient solution is 
important, and we take it into account in the following way. 



Obviously the frequency / can be written in the form 

f = N p v r a , 

where N p is the number density of perturbers, v r is the average velocity of the perturbers relative 
to atoms, and a is the cross-section. Expressing AT p in cm -3 , v r in km s — and a in units of clq 
(a Q being the Bohr radius), we have 

/ = 2.80 X icr 12 N p v T a s _1 . 
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Let t = 0 indicate the time at which the first collision takes place and suppose 
the previous collision was sufficiently far in time for its transient to be already 
damped (statistically, there will always be such a collision and nothing prevents us 
from calling it the ‘first’ collision). Employing the same notations as in Sect. 5.3, 
and making the same approximation (y g — v 2 ) as 2 v (p 0 — v), we can write 



x (t = 0_) 



\ ' 



87 \ 2 mv ' (y 0 — 1 



-v)-\r Ea u<x 



(5.177) 



Expansion on the unit vectors t- gives 



! (t = 0_) = - 



E 



1 



87 T 2 mu ^ (u 0 — av L — is) — i r 



TF E a D ai h - 



where the direction cosines D ai are defined by 1 

D ai = u a ■ t* . (5.178) 

According to the above assumptions, the effect of the collision is such that 

e n 1 



: (t = 0.) = - 



0 \ ^ 2 7? T~) p^i f 

+J ~ 8 ^mv ^ (v 0 - ais L - v) - ST “ ai * 



and returning to the basis u c 



'(< = «+) = - 



Ettv : 



8 ir 2 mv ^ (v n — aw — v) — \r 

a /3 v u L ’ 



E, 



[E 



D m e ■"* rr Pl 



tip. (5.179) 



To obtain the motion of the electron after the collision we must solve Eq. (5.28). 
Its general solution is the following 



x{t) 



e Q 1 7-1 — 27rii^t _» 

87 i 2 mv (y Q — / 3 u l — v) —1 r 13 1 



+ 



e -27ri(i/ 0 -/3^)t 

0 



where the constants Cp must be determined in such a way as to satisfy the boundary 
condition (5.179). Hence we obtain 

0/3 = “87r 2 mp{ E (^ _ ^ 1 „) _ LT [E e ‘ ^ 

K - - ir i ' 

1 The quantity Z) . could also be written as u a • as we have supposed the unit vectors t i to 
be real. However, we adopt the definition (5.178) to make easier a possible generalization of our 
hypotheses on the effects of collisions. 
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Now we perform a statistical average over collisions. Since the phases 4> i are arbi- 
trary large numbers (strong collision hypothesis), we have 

(e^) = 0 (* = 1,2,3). 



The first term in curly brackets vanishes and we obtain 



?(*) = - 



87 T 2 mv 



n 

(3 



1 — e 



— 27 ri (^q— is) t — -rgt~\ ft ^ 



— 2i^ivt 



H - P u l -v)-i r 



i 0 > 



so that (cf. Eq. (5.30)) 



, — 27ri(i/„ — 3i* T — v)t —-kt 

e 0 1 — e 0 L e 2 
^ /3 ^ S>TT 2 mu (z/ Q — (3v h — v) — ir 

or, in other words, an electric susceptibility variable with time. The average value of 
the electric susceptibility can then be obtained by multiplying its value at time t by 
the probability that no collision occurred in the interval (0, t) . If / is the frequency 
of collisions, we have 1 



<*«<*» = //»'%« dt = „) _ i r • 

0 

where 

r' = r + 2r c = i + T, (5.180) 

with 2 

C = i ■ (5-181) 

Comparison with Eq. (5.30) shows that the effect of perturbing collisions consists 
in a broadening of both the absorption profile and the associated dispersion profile. 
The same result could be formally obtained by the addition of a supplementary 
friction force in the equation of motion of the oscillator. 



The probability that a collision occurs in the interval (t, t + dt) is given by 

dp = /dt . 

Dividing the interval (0, t) into a large number N of equal parts, the probability that the first 
collision occurs in the interval (t, t + dt) is given by (see Eq. (2.62) for a similar derivation) 

dP = lim ( 1 — / — ) fdt=fe- ft dt. 

iV— *00 5 N ' 

The definition r = // 4tt (instead of f/2ir) leads to simpler forms for the expressions that 
will be encountered in the following. 
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ii) Zeeman Effect, Resonance Polarization and Hanle Effect. 

By a similar reasoning we can find the effect of perturbing collisions on the polar- 
ization of the radiation emitted by an atom in a decay process. 

We suppose that our model oscillator is excited at time t = t 0 = 0 either by an 
exciting collision or by the radiation field, and we suppose that the first collision 
takes place at time f l5 the second at time f 2 , and so on. In the interval (t Q , t x ) the 
motion of the oscillator is described by Eq. (3.23), which has the solution 

x a (t) = A a e ~jt (o = t 0 <t<t 1 ), (5.182) 

where A a are the initial amplitudes set up by the exciting mechanism. At time t 1 
the oscillator undergoes the first collision, which is characterized by its particular 
geometry (specified by the three unit vectors tW and by the corresponding phase 
shifts 0( 1> ). Along the same lines leading from Eq. (5.177) to Eq. (5.179) one can 
write for the time interval (t 1 ,t 2 ) 

x Jt) = A 6 K sl e ~ 2 " l(v °~ ai ' L)t e~ 7 * (*!<*< t 2 ) , 

s 

where A'))) , the collisional kernel due to the first collision, is given by 



(i) _ -2m(a-S) v L t 1 
^5a ~ e 



[E e'*"’ D«’ 



(5.183) 



with the direction cosines D (V defined as in Eq. (5.178). Similarly, for the time 
interval (t n ,t n+1 ) one has 



X a (t)= A sKl K %- 

Seip...p 



-27ri(^-a^ L )* 

1Y pa c 




(t n <t < t n+1 ) . 



To shorten notations we define the cumulative collisional kernel due to the first 
n collisions as 

K£= E K£K%...K£ (5.184) 

e<p...p 

with 

Zap = Kl = S a p , (5.185) 

and we can write 

x a(t) =J2 A s e ° !I ' lj)t e (t n <t< t n+1 ) , 

6 



a general formula which includes Eq. (5.182). 
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The function x a (t) can be expanded in Fourier series, 

OO 

x a {t) = J T a (v) e~ 2nwt Au , 

— OO 



where 1 



F a (y) = / *qW e ™ At = E 



(«) [ ~[ 27ri K-^L-^) + ?]* 



d£ 



= EE^ : 



n — 0 <5 



n— 0 (5 



-AT t -N t , . 

0 an 0 a n + 1 

AL 



(5.186) 



with 



iV a = 27ri (i/ 0 - ai/ L - i/) + | = 27ri [(z/ 0 - ar- L - i/) - LT] . (5.187) 



To investigate how the Zeeman effect and resonance polarization are affected 
by perturbing collisions we must start from Eqs. (3.34) and (5.75), respectively. 
Owing to the different definition of the Fourier transforms F a {y) and F (y), these 
two equations must be rewritten in the form 



E c* c 0j 7 / " 4 ^»* Fp(y) & (s.iss) 

— OO 

and 

OO 

dz^.(o) = ^ do Y, c «i c 0j 7 A' 4 ?JyT W d * > (s.isq) 

a 0 — OO 

respectively. 

From Eq. (5.186) we have 
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(5.190) 



The Fourier transform ^ (y) differs from F (y) defined in Eq. (3.32) for the presence of the 
amplitudes A s . In the special case of no collisions = 0, /C^ = S a p) we have 

T{y) = A„F(v) . 

CK V / CK CK V / 




